Sequentially Auditing Differential Privacy
Tomás González, Mateo Dulce-Rubio, Aaditya Ramdas, Mónica Ribero
TL;DR
This work develops a sequential, anytime-valid framework for auditing differential privacy guarantees of black-box mechanisms using kernel-based Maximum Mean Discrepancy (MMD) and e-value concepts. The authors derive a general MMD-based one-sided sequential test with provable Type I error control and exponential growth under the alternative, and provide a practical instantiation using Online Newton Step and Online Gradient Ascent to learn the witness and tuning parameter. They prove a tighter MMD-DP bound and demonstrate substantial improvements in sample efficiency, enabling DP violation detection with as few as a few hundred samples, compared to batch methods requiring tens to hundreds of thousands. Empirical results on additive-noise mechanisms and DP-SGD show rapid detection of privacy breaches and the ability to infer empirical privacy bounds during training, highlighting the method’s practical impact for rapid, resource-efficient privacy auditing and verification.
Abstract
We propose a practical sequential test for auditing differential privacy guarantees of black-box mechanisms. The test processes streams of mechanisms' outputs providing anytime-valid inference while controlling Type I error, overcoming the fixed sample size limitation of previous batch auditing methods. Experiments show this test detects violations with sample sizes that are orders of magnitude smaller than existing methods, reducing this number from 50K to a few hundred examples, across diverse realistic mechanisms. Notably, it identifies DP-SGD privacy violations in \textit{under} one training run, unlike prior methods needing full model training.
