Spectral Perturbation Bounds for Low-Rank Approximation with Applications to Privacy
Phuc Tran, Nisheeth K. Vishnoi, Van H. Vu
TL;DR
This work develops high-probability spectral-norm perturbation bounds for the top-$p$ rank-$p$ approximation of symmetric matrices under symmetric perturbations, refining the classical Eckart–Young–Mirsky framework. The authors introduce contour bootstrapping, a complex-analytic approach that represents spectral projectors via contour integrals and yields two main bounds: a basic bound $O\left( \|E\| \frac{\lambda_p}{\delta_p} \right)$ and a refined interaction-aware bound $\tilde{O}\left( \|E\| + r^2 x \frac{\lambda_p}{\delta_p} \right)$, accounting for eigenspace alignment through $x$ and spectral decay via the halving distance $r$. The framework extends to general spectral functionals $f(A)$ and provides improved differentially private PCA guarantees with high-probability spectral-norm bounds, surpassing prior Frobenius-based results. Empirical results on real covariance matrices corroborate the sharpness of the bounds across perturbation regimes and noise models, underscoring their practical impact for privacy-preserving low-rank analysis.
Abstract
A central challenge in machine learning is to understand how noise or measurement errors affect low-rank approximations, particularly in the spectral norm. This question is especially important in differentially private low-rank approximation, where one aims to preserve the top-$p$ structure of a data-derived matrix while ensuring privacy. Prior work often analyzes Frobenius norm error or changes in reconstruction quality, but these metrics can over- or under-estimate true subspace distortion. The spectral norm, by contrast, captures worst-case directional error and provides the strongest utility guarantees. We establish new high-probability spectral-norm perturbation bounds for symmetric matrices that refine the classical Eckart--Young--Mirsky theorem and explicitly capture interactions between a matrix $A \in \mathbb{R}^{n \times n}$ and an arbitrary symmetric perturbation $E$. Under mild eigengap and norm conditions, our bounds yield sharp estimates for $\|(A + E)_p - A_p\|$, where $A_p$ is the best rank-$p$ approximation of $A$, with improvements of up to a factor of $\sqrt{n}$. As an application, we derive improved utility guarantees for differentially private PCA, resolving an open problem in the literature. Our analysis relies on a novel contour bootstrapping method from complex analysis and extends it to a broad class of spectral functionals, including polynomials and matrix exponentials. Empirical results on real-world datasets confirm that our bounds closely track the actual spectral error under diverse perturbation regimes.
