Table of Contents
Fetching ...

Not All Learnable Distribution Classes are Privately Learnable

Mark Bun, Gautam Kamath, Argyris Mouzakis, Vikrant Singhal

TL;DR

This work demonstrates a fundamental limit: not every learnable class of distributions is privately learnable under approximate differential privacy. By constructing a trapdoor distribution class built as a shared-parameter mixture, the authors show there exists an algorithm that can learn up to a constant TV error from a constant number of samples, yet any $(\varepsilon,\delta)$-DP learner achieving the same accuracy requires infinitely many samples as the dimension grows. The core technique is a reduction from private mean estimation of binary product distributions to density estimation over the class $\mathcal{H}_{\frac{\alpha}{2},d}$, leveraging a two-component mixture where the first component serves as a key to reveal the full parameter vector. This establishes a sharp separation between non-private learnability and private learnability, providing a concrete counterexample to Ashtiani's conjecture and highlighting intrinsic limits of private distribution learning with approximate DP.

Abstract

We give an example of a class of distributions that is learnable up to constant error in total variation distance with a finite number of samples, but not learnable under $(\varepsilon, δ)$-differential privacy with the same target error. This weakly refutes a conjecture of Ashtiani.

Not All Learnable Distribution Classes are Privately Learnable

TL;DR

This work demonstrates a fundamental limit: not every learnable class of distributions is privately learnable under approximate differential privacy. By constructing a trapdoor distribution class built as a shared-parameter mixture, the authors show there exists an algorithm that can learn up to a constant TV error from a constant number of samples, yet any -DP learner achieving the same accuracy requires infinitely many samples as the dimension grows. The core technique is a reduction from private mean estimation of binary product distributions to density estimation over the class , leveraging a two-component mixture where the first component serves as a key to reveal the full parameter vector. This establishes a sharp separation between non-private learnability and private learnability, providing a concrete counterexample to Ashtiani's conjecture and highlighting intrinsic limits of private distribution learning with approximate DP.

Abstract

We give an example of a class of distributions that is learnable up to constant error in total variation distance with a finite number of samples, but not learnable under -differential privacy with the same target error. This weakly refutes a conjecture of Ashtiani.
Paper Structure (4 sections, 2 theorems, 2 equations)

This paper contains 4 sections, 2 theorems, 2 equations.

Key Result

Theorem 1.2

There exists a class of distributions $\mathcal{H}$ such that, for an absolute constant $c$:

Theorems & Definitions (3)

  • Theorem 1.2: Informal version of Theorem \ref{['thm:main_theorem_formal']}
  • Definition 2.1: Differential Privacy (DP) DworkMNS06
  • Lemma 2.2: Post Processing DworkMNS06