Private PAC Learning May be Harder than Online Learning

Mark Bun; Aloni Cohen; Rathin Desai

Private PAC Learning May be Harder than Online Learning

Mark Bun, Aloni Cohen, Rathin Desai

TL;DR

This work investigates the computational limits of privately PAC learning and its relation to online learning via Littlestone dimension. It constructs a leakage-based framework (EncThr/LEncThr) using a function-revealing encryption scheme with a leakage tfld that supports an efficient online learner but blocks polynomial-time private PAC learning under plausible cryptographic assumptions. The authors prove an explicit separation: LEncThr is efficiently online learnable with a polynomial mistake bound, yet there is no polynomial-time DP-PAC learner for LEncThr when tfld leakage is log-invariant. This establishes that polynomial-time online learnability and polynomial-time private PAC learnability are incomparable, informing the design and limits of privacy-preserving learning methods with cryptographic hardness implications.

Abstract

We continue the study of the computational complexity of differentially private PAC learning and how it is situated within the foundations of machine learning. A recent line of work uncovered a qualitative equivalence between the private PAC model and Littlestone's mistake-bounded model of online learning, in particular, showing that any concept class of Littlestone dimension $d$ can be privately PAC learned using $\mathrm{poly}(d)$ samples. This raises the natural question of whether there might be a generic conversion from online learners to private PAC learners that also preserves computational efficiency. We give a negative answer to this question under reasonable cryptographic assumptions (roughly, those from which it is possible to build indistinguishability obfuscation for all circuits). We exhibit a concept class that admits an online learner running in polynomial time with a polynomial mistake bound, but for which there is no computationally-efficient differentially private PAC learner. Our construction and analysis strengthens and generalizes that of Bun and Zhandry (TCC 2016-A), who established such a separation between private and non-private PAC learner.

Private PAC Learning May be Harder than Online Learning

TL;DR

Abstract

can be privately PAC learned using

samples. This raises the natural question of whether there might be a generic conversion from online learners to private PAC learners that also preserves computational efficiency. We give a negative answer to this question under reasonable cryptographic assumptions (roughly, those from which it is possible to build indistinguishability obfuscation for all circuits). We exhibit a concept class that admits an online learner running in polynomial time with a polynomial mistake bound, but for which there is no computationally-efficient differentially private PAC learner. Our construction and analysis strengthens and generalizes that of Bun and Zhandry (TCC 2016-A), who established such a separation between private and non-private PAC learner.

Paper Structure (25 sections, 25 theorems, 31 equations, 1 figure, 3 algorithms)

This paper contains 25 sections, 25 theorems, 31 equations, 1 figure, 3 algorithms.

Introduction
Techniques
Preliminaries
PAC Learning
Differential Privacy
Online Learning, Halving, and Thresholds
Concept Class and its Learnability
Computational Separation between PAC and Private PAC learning
Function Revealing Encryption
Correctness.
On perfectly and strongly correct evaluation.
Properties of leakage function for separation
Sufficient Leakage for Online Learning
Sufficient condition for Hardness of Private Learning
Efficient Online Learner
...and 10 more sections

Key Result

theorem 2

Let $\mathcal{F}$ be a concept class with Littlestone dimension$d = L(\mathcal{F})$. Then $\tilde{O}(d^6)$ samples are sufficient to privately learn $\mathcal{F}$ and $\Omega(\log^* d)$ samples are necessary.

Figures (1)

Figure 1: Sketch of construction

Theorems & Definitions (46)

theorem 2: AlonBLMM22GhaziGKM21
theorem 3: Informal
definition 4
definition 5: PAC Learning, Valiant84
definition 6: Efficient PAC Learning
definition 7
definition 8: Differential Privacy, DworkMNS06DworkKMMN06
lemma 9: Basic Composition DworkMNS06DworkL09
lemma 10: Group Privacy
lemma 11: Post-Processing
...and 36 more

Private PAC Learning May be Harder than Online Learning

TL;DR

Abstract

Private PAC Learning May be Harder than Online Learning

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (1)

Theorems & Definitions (46)