Balls-and-Bins Sampling for DP-SGD
Lynn Chua, Badih Ghazi, Charlie Harrison, Ethan Leeman, Pritish Kamath, Ravi Kumar, Pasin Manurangsi, Amer Sinha, Chiyuan Zhang
TL;DR
The paper proposes Balls-and-Bins sampling as a DP-SGD batch generator that mirrors Shuffle in implementation while preserving Poisson-like batch marginals, enabling favorable privacy amplification without sacrificing utility. By identifying a tightly dominating pair $(P_{\mathcal{B}}, Q_{\mathcal{B}})$ for the ABLQ$\_ {\mathcal{B}}$ mechanism, it provides a rigorous DP characterization and demonstrates improved privacy guarantees over deterministic and shuffle batching, with practical parity to Poisson subsampling in many regimes. To make privacy accounting tractable, the authors develop importance-sampling and order-statistics sampling techniques for Monte Carlo estimation of $\delta_{\mathcal{B}}(\varepsilon)$, including lower bounds, and validate these methods on large-scale datasets where Balls-and-Bins attains competitive utility. The work lays out both practical benefits and several open questions, such as tight DP accounting for $\mathcal{ABLQ}_{\mathcal{B}}$ and extensions to multi-epoch training, while aligning with concurrent results that link Balls-and-Bins to non-asymptotic privacy guarantees. Overall, the approach offers a compelling path to robust DP-SGD with Shuffle-like practicality and strong privacy amplification.
Abstract
We introduce the Balls-and-Bins sampling for differentially private (DP) optimization methods such as DP-SGD. While it has been common practice to use some form of shuffling in DP-SGD implementations, privacy accounting algorithms have typically assumed that Poisson subsampling is used instead. Recent work by Chua et al. (ICML 2024), however, pointed out that shuffling based DP-SGD can have a much larger privacy cost in practical regimes of parameters. In this work we show that the Balls-and-Bins sampling achieves the "best-of-both" samplers, namely, the implementation of Balls-and-Bins sampling is similar to that of Shuffling and models trained using DP-SGD with Balls-and-Bins sampling achieve utility comparable to those trained using DP-SGD with Shuffling at the same noise multiplier, and yet, Balls-and-Bins sampling enjoys similar-or-better privacy amplification as compared to Poisson subsampling in practical regimes.
