Mixtures of Gaussians are Privately Learnable with a Polynomial Number of Samples
Mohammad Afzali, Hassan Ashtiani, Christopher Liaw
TL;DR
This paper provides the first finite sample complexity upper bound for privately learning general Gaussian Mixtures without restrictive structural assumptions. It introduces a general reduction: if a base class admits a $(t,2\alpha/15)$-locally small $\alpha/15$-cover in total variation and is list-decodable, then its $k$-mixtures are privately learnable with poly$(k,d,1/\alpha,1/\varepsilon,\log(1/\delta))$ samples. Key innovations include a private common-member selector (PCMS), a component-wise distance $\kappa_{mix}$ for mixtures, and a compression-based list-decoding approach for Gaussians, together enabling privately learning GMMs with polynomial sample complexity. A locally small cover for Gaussians is constructed via ball-cover techniques around $N(0,I_d)$ and TV-distance bounds, enabling the private learning of GMMs despite the lack of a TV-locally-small cover for mixtures. The results bridge private density estimation and mixture modeling, offering a principled pathway to DP-learning of complex distribution classes, albeit without a computationally efficient algorithm in this work.
Abstract
We study the problem of estimating mixtures of Gaussians under the constraint of differential privacy (DP). Our main result is that $\text{poly}(k,d,1/α,1/\varepsilon,\log(1/δ))$ samples are sufficient to estimate a mixture of $k$ Gaussians in $\mathbb{R}^d$ up to total variation distance $α$ while satisfying $(\varepsilon, δ)$-DP. This is the first finite sample complexity upper bound for the problem that does not make any structural assumptions on the GMMs. To solve the problem, we devise a new framework which may be useful for other tasks. On a high level, we show that if a class of distributions (such as Gaussians) is (1) list decodable and (2) admits a "locally small'' cover (Bun et al., 2021) with respect to total variation distance, then the class of its mixtures is privately learnable. The proof circumvents a known barrier indicating that, unlike Gaussians, GMMs do not admit a locally small cover (Aden-Ali et al., 2021b).
