Table of Contents
Fetching ...

Differential Privacy in the Shuffle Model: A Survey of Separations

Albert Cheu

TL;DR

This survey analyzes differential privacy in the shuffle model, positioning it as a practical middle ground that achieves high accuracy with modest trust by anonymizing user messages. It surveys concrete protocol families, lower bounds, and robustness considerations, highlighting how shuffle amplification can surpass local-model limits while not always matching central-model performance. It then explores interactivity as a promising direction, including sequential and fully interactive shuffle variants that can approach central-model capabilities under certain conditions. The open questions identify gaps in amplification vs removal results, optimal uniformity testing sample complexity, and the ultimate potential and limits of interactive shuffle protocols. Overall, the paper maps the state of shuffle-DP, clarifying when it yields substantial gains and where fundamental limits remain.

Abstract

Differential privacy is often studied in one of two models. In the central model, a single analyzer has the responsibility of performing a privacy-preserving computation on data. But in the local model, each data owner ensures their own privacy. Although it removes the need to trust the analyzer, local privacy comes at a price: a locally private protocol is less accurate than a centrally private counterpart when solving many learning and estimation problems. Protocols in the shuffle model are designed to attain the best of both worlds: recent work has shown high accuracy is possible with only a mild trust assumption. This survey paper gives an overview of novel shuffle protocols, along with lower bounds that establish the limits of the new model. We also summarize work that show the promise of interactivity in the shuffle model.

Differential Privacy in the Shuffle Model: A Survey of Separations

TL;DR

This survey analyzes differential privacy in the shuffle model, positioning it as a practical middle ground that achieves high accuracy with modest trust by anonymizing user messages. It surveys concrete protocol families, lower bounds, and robustness considerations, highlighting how shuffle amplification can surpass local-model limits while not always matching central-model performance. It then explores interactivity as a promising direction, including sequential and fully interactive shuffle variants that can approach central-model capabilities under certain conditions. The open questions identify gaps in amplification vs removal results, optimal uniformity testing sample complexity, and the ultimate potential and limits of interactive shuffle protocols. Overall, the paper maps the state of shuffle-DP, clarifying when it yields substantial gains and where fundamental limits remain.

Abstract

Differential privacy is often studied in one of two models. In the central model, a single analyzer has the responsibility of performing a privacy-preserving computation on data. But in the local model, each data owner ensures their own privacy. Although it removes the need to trust the analyzer, local privacy comes at a price: a locally private protocol is less accurate than a centrally private counterpart when solving many learning and estimation problems. Protocols in the shuffle model are designed to attain the best of both worlds: recent work has shown high accuracy is possible with only a mild trust assumption. This survey paper gives an overview of novel shuffle protocols, along with lower bounds that establish the limits of the new model. We also summarize work that show the promise of interactivity in the shuffle model.

Paper Structure

This paper contains 44 sections, 29 theorems, 22 equations, 6 tables, 2 algorithms.

Key Result

Lemma 2

Let $f \colon \mathcal{X}^n \to \mathbb{Z}$ be a 1-sensitive function, i.e. $|f(\vec{x}) - f(\vec{x}\,')| \leq 1$ for all neighboring datasets $\vec{x}, \vec{x}\,' \in \mathcal{X}^n$. There is a constant $\kappa$ such that, for any $\ell\in\mathbb{N}$, $p\in(0,1)$, and $\varepsilon, \delta \in (0,1) the algorithm that samples $\eta \sim \mathbf{Bin}(\ell, p)$ and outputs $f(\vec{x}) + \eta$ is $(\

Theorems & Definitions (48)

  • Definition 1: Differential Privacy DMNS06
  • Lemma 2: Binomial Mechanism DKMMN06GGK+19
  • Definition 3: Local Model Warner65EGS03
  • Definition 4: DP in the Local Model DMNS06KLNRS08
  • Definition 5: Shuffle Model BittauEMMRLRKTS17CSU+19
  • Remark 6
  • Definition 7: DP in the Shuffle Model CSU+19
  • Definition 8: Robust DP in the Shuffle Model
  • Theorem 9: Beimel et al. BNO08 & Chan et al. CSS12
  • Theorem 10: Cheu et al. CSU+19
  • ...and 38 more