$f$-Differential Privacy Filters: Validity and Approximate Solutions
Long Tran, Antti Koskela, Ossi Räisä, Antti Honkela
TL;DR
This work investigates privacy accounting under fully adaptive differential privacy using $f$-DP, proving that the natural filter based on composing trade-off functions and stopping at a budget is not valid in general. It identifies a structural condition—Blackwell chains—under which such a filter becomes valid and shows this holds for Gaussian (GDP) trade-offs but not universally for subsampled Gaussian mechanisms. The authors then develop a fully adaptive central limit theorem for privacy-loss processes and construct an approximate GDP filter tailored to DP-SGD, yielding tighter guarantees than fully adaptive RDP in regimes where the sampling rate is very small or very large. The results illuminate when tensor-product based accounting can be effective and provide a practical, provably tighter privacy filter for adaptive machine-learning workflows, with implications for mechanism-specific privacy accounting.
Abstract
Accounting for privacy loss under fully adaptive composition -- where both the choice of mechanisms and their privacy parameters may depend on the entire history of prior outputs -- is a central challenge in differential privacy (DP). In this setting, privacy filters are stopping rules for compositions that ensure a prescribed global privacy budget is not exceeded. It remains unclear whether optimal trade-off-function-based notions, such as $f$-DP, admit valid privacy filters under fully adaptive interaction. We show that the natural approach to defining an $f$-DP filter -- composing individual trade-off curves and stopping when the prescribed $f$-DP curve is crossed -- is fundamentally invalid. We characterise when and why this failure occurs, and establish necessary and sufficient conditions under which the natural filter is valid. Furthermore, we prove a fully adaptive central limit theorem for $f$-DP and construct an approximate Gaussian DP filter for subsampled Gaussian mechanisms at small sampling rates $q<0.2$ and large sampling rates $q>0.8$, yielding tighter privacy guarantees than filters based on Rényi DP in the same setting.
