Lower Bounds for Private Estimation of Gaussian Covariance Matrices under All Reasonable Parameter Regimes
Victor S. Portella, Nick Harvey
TL;DR
This paper studies the problem of estimating the covariance of a Gaussian under $(\varepsilon,\delta)$-DP and proves a matching $n=\Omega\left(\frac{d^{2}}{\varepsilon\alpha}\right)$ lower bound for well-conditioned covariances, thereby closing a gap in the parameter regimes where previous work had gaps. The authors extend the score-attack fingerprinting framework to non-i.i.d. priors by leveraging the Stein-Haff identity, and they design a natural prior over $\Sigma$ via a normalized Wishart distribution with $D=2d$ that yields robust, polylog-free lower bounds. The analysis combines a lower bound on a correlation statistic (via Stein-Haff) with an upper bound derived from DP properties and tail bounds, showing that private covariance estimation requires essentially the same sample complexity as the non-private setting up to privacy terms. A key technical contribution is handling non-independent parameters in the score-attack approach, enabling lower bounds for a broad class of priors and parameter regimes. The results provide a unified, rigorous view of DP covariance estimation lower bounds and offer a methodology that may extend to other parameter estimation problems under privacy constraints.
Abstract
We prove lower bounds on the number of samples needed to privately estimate the covariance matrix of a Gaussian distribution. Our bounds match existing upper bounds in the widest known setting of parameters. Our analysis relies on the Stein-Haff identity, an extension of the classical Stein's identity used in previous fingerprinting lemma arguments.
