Table of Contents
Fetching ...

The Limits of Differential Privacy in Online Learning

Bo Li, Wei Wang, Peng Ye

TL;DR

This work characterizes the limits of differential privacy in online learning, showing a sharp separation between no DP, pure DP, and approximate DP. It proves that approximate DP is necessary to handle adaptive adversaries, while pure DP forces the number of mistakes to grow at least logarithmically with the horizon for many classes, and even to infinity for infinite horizons in some cases. By linking learnability to the Littlestone dimension and representation dimension, the authors establish universal lower bounds $\Omega(\log T)$ and $\Omega(\mathrm{LD}(\mathcal{H}))\log T$, and they compare these with existing upper bounds, highlighting gaps and guiding future research on private online learning. The results have implications for private online inference in adversarial contexts and clarify the cost of privacy in sequential decision-making tasks.

Abstract

Differential privacy (DP) is a formal notion that restricts the privacy leakage of an algorithm when running on sensitive data, in which privacy-utility trade-off is one of the central problems in private data analysis. In this work, we investigate the fundamental limits of differential privacy in online learning algorithms and present evidence that separates three types of constraints: no DP, pure DP, and approximate DP. We first describe a hypothesis class that is online learnable under approximate DP but not online learnable under pure DP under the adaptive adversarial setting. This indicates that approximate DP must be adopted when dealing with adaptive adversaries. We then prove that any private online learner must make an infinite number of mistakes for almost all hypothesis classes. This essentially generalizes previous results and shows a strong separation between private and non-private settings since a finite mistake bound is always attainable (as long as the class is online learnable) when there is no privacy requirement.

The Limits of Differential Privacy in Online Learning

TL;DR

This work characterizes the limits of differential privacy in online learning, showing a sharp separation between no DP, pure DP, and approximate DP. It proves that approximate DP is necessary to handle adaptive adversaries, while pure DP forces the number of mistakes to grow at least logarithmically with the horizon for many classes, and even to infinity for infinite horizons in some cases. By linking learnability to the Littlestone dimension and representation dimension, the authors establish universal lower bounds and , and they compare these with existing upper bounds, highlighting gaps and guiding future research on private online learning. The results have implications for private online inference in adversarial contexts and clarify the cost of privacy in sequential decision-making tasks.

Abstract

Differential privacy (DP) is a formal notion that restricts the privacy leakage of an algorithm when running on sensitive data, in which privacy-utility trade-off is one of the central problems in private data analysis. In this work, we investigate the fundamental limits of differential privacy in online learning algorithms and present evidence that separates three types of constraints: no DP, pure DP, and approximate DP. We first describe a hypothesis class that is online learnable under approximate DP but not online learnable under pure DP under the adaptive adversarial setting. This indicates that approximate DP must be adopted when dealing with adaptive adversaries. We then prove that any private online learner must make an infinite number of mistakes for almost all hypothesis classes. This essentially generalizes previous results and shows a strong separation between private and non-private settings since a finite mistake bound is always attainable (as long as the class is online learnable) when there is no privacy requirement.

Paper Structure

This paper contains 28 sections, 23 theorems, 53 equations, 1 table, 8 algorithms.

Key Result

Lemma 3.2

There exists an $(\alpha, \beta)$-probabilistic representation for $\mathcal{H}$ with

Theorems & Definitions (45)

  • Definition 2.1: Shattered Tree
  • Definition 2.2: Littlestone Dimension
  • Definition 2.3: Differential Privacy
  • Definition 3.1: Representation Dimension
  • Lemma 3.2
  • Theorem 3.3
  • Theorem 3.4
  • Theorem 3.5
  • Corollary 3.6
  • Definition 4.1
  • ...and 35 more