Privacy Amplification for the Gaussian Mechanism via Bounded Support
Shengyuan Hu, Saeed Mahloujifar, Virginia Smith, Kamalika Chaudhuri, Chuan Guo
TL;DR
The paper tackles the conservatism of worst-case differential privacy by leveraging data-dependent accounting frameworks (pDP and FIL). It introduces Gaussian mechanisms with bounded support—specifically rectified and truncated Gaussian—and analyzes their privacy amplification under both FIL and per-instance Rényi DP, demonstrating that the amplification is stronger in the tails of the input. The authors derive closed-form expressions for FIL and RDP for these bounded mechanisms, establish coordinate-wise tensorization to handle high-dimensional data, and discuss subsampling challenges. Empirical results on private SGD show meaningful privacy-cost reductions (up to $>30\%$ in some settings) with little to no loss in utility on benchmarks like CIFAR-10/100, validating the practical value of data-dependent privacy amplification via bounded noise. The work suggests a promising direction for tighter, data-aware privacy guarantees in real-world ML systems and highlights future avenues such as per-coordinate subsampling and deeper integration with compression-inspired methods.
Abstract
Data-dependent privacy accounting frameworks such as per-instance differential privacy (pDP) and Fisher information loss (FIL) confer fine-grained privacy guarantees for individuals in a fixed training dataset. These guarantees can be desirable compared to vanilla DP in real world settings as they tightly upper-bound the privacy leakage for a $\textit{specific}$ individual in an $\textit{actual}$ dataset, rather than considering worst-case datasets. While these frameworks are beginning to gain popularity, to date, there is a lack of private mechanisms that can fully leverage advantages of data-dependent accounting. To bridge this gap, we propose simple modifications of the Gaussian mechanism with bounded support, showing that they amplify privacy guarantees under data-dependent accounting. Experiments on model training with DP-SGD show that using bounded support Gaussian mechanisms can provide a reduction of the pDP bound $ε$ by as much as 30% without negative effects on model utility.
