DP-SGD with weight clipping
Antoine Barczewski, Jan Ramon
TL;DR
This paper addresses private training with differential privacy by moving beyond gradient clipping to a Lipschitz-based sensitivity bound. It introduces Lip-DP-SGD, which enforces Lipschitz constraints via ClipWeights and per-layer sensitivity estimates, enabling noise to be scaled without clipping biases. The approach delivers state-of-the-art accuracy under DP on image and tabular datasets and provides an open-source Lip-DP-SGD toolkit built on PyTorch/Opacus. The work demonstrates that weight clipping and Lipschitz control can substantially improve the privacy-utility trade-off and offers practical guidance for private deep learning. Overall, Lip-DP-SGD advances private training by reducing bias and leveraging per-layer Lipschitz bounds to set noise levels more precisely.
Abstract
Recently, due to the popularity of deep neural networks and other methods whose training typically relies on the optimization of an objective function, and due to concerns for data privacy, there is a lot of interest in differentially private gradient descent methods. To achieve differential privacy guarantees with a minimum amount of noise, it is important to be able to bound precisely the sensitivity of the information which the participants will observe. In this study, we present a novel approach that mitigates the bias arising from traditional gradient clipping. By leveraging a public upper bound of the Lipschitz value of the current model and its current location within the search domain, we can achieve refined noise level adjustments. We present a new algorithm with improved differential privacy guarantees and a systematic empirical evaluation, showing that our new approach outperforms existing approaches also in practice.
