Table of Contents
Fetching ...

Dimension-free Private Mean Estimation for Anisotropic Distributions

Yuval Dagan, Michael I. Jordan, Xuelin Yang, Lydia Zakynthinou, Nikita Zhivotovskiy

TL;DR

This work develops estimators that are appropriate for real-world data is often highly anisotropic, with signals concentrated on a small number of principal components and shows that this is the optimal sample complexity for this task up to logarithmic factors.

Abstract

We present differentially private algorithms for high-dimensional mean estimation. Previous private estimators on distributions over $\mathbb{R}^d$ suffer from a curse of dimensionality, as they require $Ω(d^{1/2})$ samples to achieve non-trivial error, even in cases where $O(1)$ samples suffice without privacy. This rate is unavoidable when the distribution is isotropic, namely, when the covariance is a multiple of the identity matrix, or when accuracy is measured with respect to the affine-invariant Mahalanobis distance. Yet, real-world data is often highly anisotropic, with signals concentrated on a small number of principal components. We develop estimators that are appropriate for such signals$\unicode{x2013}$our estimators are $(\varepsilon,δ)$-differentially private and have sample complexity that is dimension-independent for anisotropic subgaussian distributions. Given $n$ samples from a distribution with known covariance-proxy $Σ$ and unknown mean $μ$, we present an estimator $\hatμ$ that achieves error $\|\hatμ-μ\|_2\leq α$, as long as $n\gtrsim\mathrm{tr}(Σ)/α^2+ \mathrm{tr}(Σ^{1/2})/(α\varepsilon)$. In particular, when $\pmbσ^2=(σ_1^2, \ldots, σ_d^2)$ are the singular values of $Σ$, we have $\mathrm{tr}(Σ)=\|\pmbσ\|_2^2$ and $\mathrm{tr}(Σ^{1/2})=\|\pmbσ\|_1$, and hence our bound avoids dimension-dependence when the signal is concentrated in a few principal components. We show that this is the optimal sample complexity for this task up to logarithmic factors. Moreover, for the case of unknown covariance, we present an algorithm whose sample complexity has improved dependence on the dimension, from $d^{1/2}$ to $d^{1/4}$.

Dimension-free Private Mean Estimation for Anisotropic Distributions

TL;DR

This work develops estimators that are appropriate for real-world data is often highly anisotropic, with signals concentrated on a small number of principal components and shows that this is the optimal sample complexity for this task up to logarithmic factors.

Abstract

We present differentially private algorithms for high-dimensional mean estimation. Previous private estimators on distributions over suffer from a curse of dimensionality, as they require samples to achieve non-trivial error, even in cases where samples suffice without privacy. This rate is unavoidable when the distribution is isotropic, namely, when the covariance is a multiple of the identity matrix, or when accuracy is measured with respect to the affine-invariant Mahalanobis distance. Yet, real-world data is often highly anisotropic, with signals concentrated on a small number of principal components. We develop estimators that are appropriate for such signalsour estimators are -differentially private and have sample complexity that is dimension-independent for anisotropic subgaussian distributions. Given samples from a distribution with known covariance-proxy and unknown mean , we present an estimator that achieves error , as long as . In particular, when are the singular values of , we have and , and hence our bound avoids dimension-dependence when the signal is concentrated in a few principal components. We show that this is the optimal sample complexity for this task up to logarithmic factors. Moreover, for the case of unknown covariance, we present an algorithm whose sample complexity has improved dependence on the dimension, from to .

Paper Structure

This paper contains 22 sections, 19 theorems, 33 equations, 2 algorithms.

Key Result

Theorem 1.1

Any $\varepsilon$-DP algorithm which estimates the mean $\mu\in\mathcal{B}^d(R)$ (i.e., $\mu$ belongs to the Euclidean ball of radius $R$) of a Gaussian distribution up to constant accuracy requires $n= \Omega\mathopen{}\mathclose{\left(\frac{d\log(R)}{\varepsilon}\right)$ samples.

Theorems & Definitions (35)

  • Theorem 1.1: Pure DP Lower Bound, informal
  • Theorem 1.2: Upper bound, known covariance, informal
  • Theorem 1.3: Lower bound, informal
  • Theorem 1.4: Upper bound, unknown covariance, informal
  • Definition 2.1: $(\varepsilon,\delta)$-indistinguishability
  • Definition 2.2: Differential Privacy DworkMNS06
  • Definition 2.3: Laplace distribution
  • Definition 2.4: Laplace Mechanism, DworkMNS06
  • Lemma 2.5: DworkMNS06
  • Definition 2.6: Gaussian Mechanism, DworkMNS06
  • ...and 25 more