Sparse Principal Component Analysis with Energy Profile Dependent Sample Complexity
Mengchu Xu, Jian Wang, Yonina C. Eldar
TL;DR
The paper tackles sparse PCA in high-dimensional, sample-scarce regimes where spike energy is non-uniform. It introduces Spectral Energy Pursuit (SEP), an iterative, computationally efficient method that leverages a structure function s(p) to adapt to the spike’s energy profile, achieving a sample complexity of m ≈ max_{1≤p≤k} p s^2(p) log n in the worst case and improving toward k log n as energy concentrates. A lightweight post-processing step using a single iteration of the truncated power method with a centered operator further guarantees a uniform statistical error bound. Empirical results across flat, power-law, and exponential signals demonstrate SEP’s ability to adapt without tuning and to outperform existing approaches, especially on non-flat profiles.
Abstract
We study sparse principal component analysis in the high-dimensional, sample-limited regime, aiming to recover a leading component supported on a few coordinates. Despite extensive progress, most methods and analyses are tailored to the flat-spike case, offering little guidance when spike energy is unevenly distributed across the support. Motivated by this, we propose Spectral Energy Pursuit (SEP), an effective iterative scheme that repeatedly screens and reselects coordinates, with a sample complexity that adapts to the energy profile. We develop our framework around a structure function \(s(p)\) that quantifies how spike energy accumulates over its top \(p\) entries. We establish that SEP succeeds with a sample size of order \(\max_{1\le p\le k} p\,s^2(p)\,\log n\), which matches the classical \(k^2\log n\) sample complexity for flat spikes and improves toward the \(k\log n\) regime as the profile becomes more concentrated. As a lightweight post-processing, a single truncated power iteration is proven to enable the final estimator to attain a uniform statistical error bound. Empirical simulations across flat, power-law, and exponential signals validate that SEP adapts to profile structure without tuning and outperforms existing algorithms.
