Mode-wise Principal Subspace Pursuit and Matrix Spiked Covariance Model
Runshi Tang, Ming Yuan, Anru R. Zhang
TL;DR
This work introduces Mode-wise Principal Subspace Pursuit (MOP-UP) to perform joint row- and column-wise dimensionality reduction on collections of matrices by leveraging matrix-variate spiked covariance models and their higher-order generalizations. The method combines Average Subspace Capture (ASC) for initialization with Alternating Projection (AP) to iteratively refine shared row and column subspaces, underpinned by a novel blockwise eigenspace perturbation bound that tightens perturbation analysis beyond classical results. Theoretical contributions include finite-sample error bounds for initialization and linear convergence of AP, plus a global high-probability theory for MOP-UP and its tensor generalizations. Empirically, MOP-UP demonstrates effective dimension reduction and improved downstream performance on MNIST and fMRI datasets, while offering a clear path to higher-order tensor extensions with identifiability guarantees. These results provide a versatile, scalable framework for covariance-aware dimension reduction in multi-subject matrix and tensor data.
Abstract
This paper introduces a novel framework called Mode-wise Principal Subspace Pursuit (MOP-UP) to extract hidden variations in both the row and column dimensions for matrix data. To enhance the understanding of the framework, we introduce a class of matrix-variate spiked covariance models that serve as inspiration for the development of the MOP-UP algorithm. The MOP-UP algorithm consists of two steps: Average Subspace Capture (ASC) and Alternating Projection (AP). These steps are specifically designed to capture the row-wise and column-wise dimension-reduced subspaces which contain the most informative features of the data. ASC utilizes a novel average projection operator as initialization and achieves exact recovery in the noiseless setting. We analyze the convergence and non-asymptotic error bounds of MOP-UP, introducing a blockwise matrix eigenvalue perturbation bound that proves the desired bound, where classic perturbation bounds fail. The effectiveness and practical merits of the proposed framework are demonstrated through experiments on both simulated and real datasets. Lastly, we discuss generalizations of our approach to higher-order data.
