Table of Contents
Fetching ...

Private Realizable-to-Agnostic Transformation with Near-Optimal Sample Complexity

Bo Li, Wei Wang, Peng Ye

TL;DR

An improved construction is given that eliminates the dependence on $\varepsilon$ and achieves a near-optimal extra sample complexity of $\widetilde{O}(\mathrm{VC}(\mathcal{C})/\alpha^2)$ for any $\varepsilon\le 1$.

Abstract

The realizable-to-agnostic transformation (Beimel et al., 2015; Alon et al., 2020) provides a general mechanism to convert a private learner in the realizable setting (where the examples are labeled by some function in the concept class) to a private learner in the agnostic setting (where no assumptions are imposed on the data). Specifically, for any concept class $\mathcal{C}$ and error parameter $α$, a private realizable learner for $\mathcal{C}$ can be transformed into a private agnostic learner while only increasing the sample complexity by $\widetilde{O}(\mathrm{VC}(\mathcal{C})/α^2)$, which is essentially tight assuming a constant privacy parameter $\varepsilon = Θ(1)$. However, when $\varepsilon$ can be arbitrary, one has to apply the standard privacy-amplification-by-subsampling technique (Kasiviswanathan et al., 2011), resulting in a suboptimal extra sample complexity of $\widetilde{O}(\mathrm{VC}(\mathcal{C})/α^2\varepsilon)$ that involves a $1/\varepsilon$ factor. In this work, we give an improved construction that eliminates the dependence on $\varepsilon$, thereby achieving a near-optimal extra sample complexity of $\widetilde{O}(\mathrm{VC}(\mathcal{C})/α^2)$ for any $\varepsilon\le 1$. Moreover, our result reveals that in private agnostic learning, the privacy cost is only significant for the realizable part. We also leverage our technique to obtain a nearly tight sample complexity bound for the private prediction problem, resolving an open question posed by Dwork and Feldman (2018) and Dagan and Feldman (2020).

Private Realizable-to-Agnostic Transformation with Near-Optimal Sample Complexity

TL;DR

An improved construction is given that eliminates the dependence on and achieves a near-optimal extra sample complexity of for any .

Abstract

The realizable-to-agnostic transformation (Beimel et al., 2015; Alon et al., 2020) provides a general mechanism to convert a private learner in the realizable setting (where the examples are labeled by some function in the concept class) to a private learner in the agnostic setting (where no assumptions are imposed on the data). Specifically, for any concept class and error parameter , a private realizable learner for can be transformed into a private agnostic learner while only increasing the sample complexity by , which is essentially tight assuming a constant privacy parameter . However, when can be arbitrary, one has to apply the standard privacy-amplification-by-subsampling technique (Kasiviswanathan et al., 2011), resulting in a suboptimal extra sample complexity of that involves a factor. In this work, we give an improved construction that eliminates the dependence on , thereby achieving a near-optimal extra sample complexity of for any . Moreover, our result reveals that in private agnostic learning, the privacy cost is only significant for the realizable part. We also leverage our technique to obtain a nearly tight sample complexity bound for the private prediction problem, resolving an open question posed by Dwork and Feldman (2018) and Dagan and Feldman (2020).

Paper Structure

This paper contains 9 sections, 15 theorems, 61 equations, 3 algorithms.

Key Result

Theorem 1.1

An $(\varepsilon, \delta)$-differentially private realizable learner for $\mathcal{C}$ with error $\alpha$ and with sample complexity $m$ can be transformed into an $(\varepsilon,\delta)$-differentially private agnostic learner for $\mathcal{C}$ with excess error $O(\alpha)$ and with sample complexi

Theorems & Definitions (30)

  • Theorem 1.1: Informal Version of Theorem \ref{['thm:trans']}
  • Theorem 1.2
  • Definition 2.1: PAC Learning valiant1984theory
  • Definition 2.2: Agnostic Learning
  • Definition 2.3: The Growth Function
  • Lemma 2.4: Sauer's Lemma
  • Lemma 2.5: Realizable Generalization Bound
  • Lemma 2.6: Agnostic Generalization Bound
  • Definition 2.7: Differential Privacy dwork2006calibratingdwork2006our
  • Definition 2.8: Private Prediction dwork2018privacy
  • ...and 20 more