Auditing Private Prediction

Karan Chadha; Matthew Jagielski; Nicolas Papernot; Christopher Choquette-Choo; Milad Nasr

Auditing Private Prediction

Karan Chadha, Matthew Jagielski, Nicolas Papernot, Christopher Choquette-Choo, Milad Nasr

TL;DR

This work introduces the first auditing framework for private prediction, extending differential privacy auditing to inference-time privacy leakage. By modeling per-query Renyi DP and employing exact Renyi divergence calculations for noisy-argmax across histograms, the authors derive informative lower and upper bounds and apply them to four private-prediction algorithms: PATE, CaPC, PromptPATE, and Private-kNN. They design adversaries with varying poisoning and query capabilities (Nat-MIAQ, Pois-MIAQ, Pois-MINQ) and demonstrate that leakage grows with poisoning ability and query control, while data-dependent analyses can tighten or loosen bounds. The framework enables attribution of leakage to algorithm design and adversarial capabilities, offering a practical tool for assessing privacy of private-prediction systems in real-world tasks such as LLM task adaptation. The results indicate substantial room to improve privacy analyses and suggest directions for stronger attacks, distribution-aware auditing, and leveraging partitioning randomness to refine bounds.

Abstract

Differential privacy (DP) offers a theoretical upper bound on the potential privacy leakage of analgorithm, while empirical auditing establishes a practical lower bound. Auditing techniques exist forDP training algorithms. However machine learning can also be made private at inference. We propose thefirst framework for auditing private prediction where we instantiate adversaries with varying poisoningand query capabilities. This enables us to study the privacy leakage of four private prediction algorithms:PATE [Papernot et al., 2016], CaPC [Choquette-Choo et al., 2020], PromptPATE [Duan et al., 2023],and Private-kNN [Zhu et al., 2020]. To conduct our audit, we introduce novel techniques to empiricallyevaluate privacy leakage in terms of Renyi DP. Our experiments show that (i) the privacy analysis ofprivate prediction can be improved, (ii) algorithms which are easier to poison lead to much higher privacyleakage, and (iii) the privacy leakage is significantly lower for adversaries without query control than thosewith full control.

Auditing Private Prediction

TL;DR

Abstract

Paper Structure (56 sections, 1 theorem, 16 equations, 31 figures, 2 tables, 1 algorithm)

This paper contains 56 sections, 1 theorem, 16 equations, 31 figures, 2 tables, 1 algorithm.

Introduction
Related Work
Notation and Preliminaries
Differential Privacy
Auditing Differential Privacy
Attack Interpretation.
Private Prediction
Data Independent.
Data Dependent.
Auditing Private Prediction
Adversaries
$\textbf{Nat-MIAQ}$.
$\textbf{Pois-MIAQ}$.
$\textbf{Pois-MINQ}$.
Auditing RDP of Noisy Argmax
...and 41 more sections

Key Result

Theorem 1

balle2020hypothesis If an algorithm is $(\alpha,\varepsilon_{\alpha})$-RDP, then it is also $(\varepsilon_{\alpha} + \log(\frac{\alpha - 1}{\alpha}) - \frac{\log{\delta} + \log{\alpha}}{\alpha - 1},\delta)$-DP for any $\delta \in (0,1)$.

Figures (31)

Figure 1: Framework to audit private prediction algorithms.
Figure 2: RDP audit for noisy argmax
Figure 3: Privacy leakage of PATE, CaPC and PromptPATE for adversaries with adversarial query capability.
Figure 4: Privacy leakage of PATE, CaPC and PromptPATE for adversaries restricted to making natural queries.
Figure 5: Privacy leakage for Private-kNN
...and 26 more figures

Theorems & Definitions (4)

Definition 1: Approximate DP
Definition 2: Renyi DP
Theorem 1
Definition 3: Private Prediction Interface

Auditing Private Prediction

TL;DR

Abstract

Auditing Private Prediction

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (31)

Theorems & Definitions (4)