How Private are DP-SGD Implementations?
Lynn Chua, Badih Ghazi, Pritish Kamath, Ravi Kumar, Pasin Manurangsi, Amer Sinha, Chiyuan Zhang
TL;DR
The work investigates how batch sampling choices in DP-SGD, framed through Adaptive Batch Linear Queries, influence privacy guarantees with δ_B(ε) curves for deterministic, Poisson, and shuffle batchers. By employing dominating pairs and hockey-stick divergence, it shows that shuffle batching provides always stronger privacy than deterministic batching, but Poisson subsampling can be either more or less private than deterministic batching depending on ε, and shuffle amplification can be substantially weaker than expected. The authors derive closed-form privacy bounds for deterministic and Poisson cases, discuss the absence of a proven tightly dominating pair for shuffle, and provide numerical evidence using PLD/RDP accounting to illustrate the potential misreporting risks when equating shuffle with Poisson analyses. The results highlight that batch sampler choice materially affects reported DP-SGD privacy, underscoring the need for careful, sampler-specific privacy accounting in practice and motivating further work on tight shuffle-based DP analysis and multi-epoch settings.
Abstract
We demonstrate a substantial gap between the privacy guarantees of the Adaptive Batch Linear Queries (ABLQ) mechanism under different types of batch sampling: (i) Shuffling, and (ii) Poisson subsampling; the typical analysis of Differentially Private Stochastic Gradient Descent (DP-SGD) follows by interpreting it as a post-processing of ABLQ. While shuffling-based DP-SGD is more commonly used in practical implementations, it has not been amenable to easy privacy analysis, either analytically or even numerically. On the other hand, Poisson subsampling-based DP-SGD is challenging to scalably implement, but has a well-understood privacy analysis, with multiple open-source numerically tight privacy accountants available. This has led to a common practice of using shuffling-based DP-SGD in practice, but using the privacy analysis for the corresponding Poisson subsampling version. Our result shows that there can be a substantial gap between the privacy analysis when using the two types of batch sampling, and thus advises caution in reporting privacy parameters for DP-SGD.
