Provable Model-Parallel Distributed Principal Component Analysis with Parallel Deflation
Fangshuo Liao, Wenyi Su, Anastasios Kyrillidis
TL;DR
This work tackles scalable distributed PCA by introducing Parallel Deflation, a model-parallel framework where each worker computes a distinct principal component in parallel using intermediate updates from peers. By reformulating deflation into an iterative, multi-round procedure, the method eliminates strict sequential dependencies and supports asynchronous communication with modest overhead. The authors prove a convergence guarantee under a contraction assumption for the Top1 subroutine, showing nearly-linear convergence after a data-dependent warmup via a Davis–Kahan based analysis. Empirical results on synthetic data, MNIST, and ImageNet demonstrate competitive performance with the state-of-the-art EigenGame variants and confirm scalability to large-scale datasets.
Abstract
We study a distributed Principal Component Analysis (PCA) framework where each worker targets a distinct eigenvector and refines its solution by updating from intermediate solutions provided by peers deemed as "superior". Drawing intuition from the deflation method in centralized eigenvalue problems, our approach breaks the sequential dependency in the deflation steps and allows asynchronous updates of workers, while incurring only a small communication cost. To our knowledge, a gap in the literature -- the theoretical underpinning of such distributed, dynamic interactions among workers -- has remained unaddressed. This paper offers a theoretical analysis explaining why, how, and when these intermediate, hierarchical updates lead to practical and provable convergence in distributed environments. Despite being a theoretical work, our prototype implementation demonstrates that such a distributed PCA algorithm converges effectively and in scalable way: through experiments, our proposed framework offers comparable performance to EigenGame-$μ$, the state-of-the-art model-parallel PCA solver.
