Private Low-Rank Approximation for Covariance Matrices, Dyson Brownian Motion, and Eigenvalue-Gap Bounds for Gaussian Perturbations
Oren Mangoubi, Nisheeth K. Vishnoi
TL;DR
This paper studies privately releasing a rank-$k$ covariance approximation under $(\varepsilon,\delta)$-differential privacy by introducing a complex Gaussian mechanism and analyzing it through a matrix-Dyson diffusion lens. By treating Gaussian perturbations as a continuous-time Dyson Brownian motion, the authors derive SDEs for the evolving eigenvalues and eigenvectors and obtain a time-integrated bound on the Frobenius utility that scales like $\tilde{O}(\sqrt{kd})\,\frac{\sigma_k}{\sigma_k-\sigma_{k+1}}\,\frac{\sqrt{\log(1/\delta)}}{\varepsilon}$ under a sizeable $k$-th eigengap, outperforming previous results in gap-rich regimes. They further prove high-probability eigenvalue-gap bounds for GUE/GOE perturbations, showing that gaps between adjacent eigenvalues remain sizable with probability bounds of the form $\mathbb{P}(\eta_i-\eta_{i+1} \le c\,s/\sqrt{d}) \le s^{\beta+1}+d^{-1000}$, where $\beta=2$ (complex) or $1$ (real). The framework extends to private subspace recovery and provides insights into average-case perturbations and random-matrix eigenvalue gaps, with potential broader impact in privacy-preserving data analysis and numerical linear algebra under random noise. Overall, the work combines stochastic calculus, random matrix theory, and DP to yield tighter, structure-exploiting guarantees for private low-rank covariance approximation. Key techniques include dynamical eigenspace tracking via Dyson Brownian motion and carefully designed time-varying rank-$k$ projections that mitigate ill-conditioning from small eigen-gaps.
Abstract
We consider the problem of approximating a $d \times d$ covariance matrix $M$ with a rank-$k$ matrix under $(\varepsilon,δ)$-differential privacy. We present and analyze a complex variant of the Gaussian mechanism and obtain upper bounds on the Frobenius norm of the difference between the matrix output by this mechanism and the best rank-$k$ approximation to $M$. Our analysis provides improvements over previous bounds, particularly when the spectrum of $M$ satisfies natural structural assumptions. The novel insight is to view the addition of Gaussian noise to a matrix as a continuous-time matrix Brownian motion. This viewpoint allows us to track the evolution of eigenvalues and eigenvectors of the matrix, which are governed by stochastic differential equations discovered by Dyson. These equations enable us to upper bound the Frobenius distance between the best rank-$k$ approximation of $M$ and that of a Gaussian perturbation of $M$ as an integral that involves inverse eigenvalue gaps of the stochastically evolving matrix, as opposed to a sum of perturbation bounds obtained via Davis-Kahan-type theorems. Subsequently, again using the Dyson Brownian motion viewpoint, we show that the eigenvalues of the matrix $M$ perturbed by Gaussian noise have large gaps with high probability. These results also contribute to the analysis of low-rank approximations under average-case perturbations, and to an understanding of eigenvalue gaps for random matrices, both of which may be of independent interest.
