Learning with Differentially Private (Sliced) Wasserstein Gradients
David Rodríguez-Vítores, Clément Lalanne, Jean-Michel Loubes
TL;DR
This work tackles private learning for objectives based on Wasserstein distances between empirical distributions. It derives a fully discrete, tractable gradient formulation and a sharp sensitivity bound, enabling Gaussian mechanisms to privately estimate Wasserstein gradients with controlled utility loss. The authors develop a deep-learning DP framework with clipping and privacy accounting, and demonstrate two key applications: privately training Sliced Wasserstein Autoencoders and private in-processing for fairness. Empirical results on image datasets show that the proposed approach maintains accuracy while providing strong privacy guarantees, and the framework also yields privacy-preserving generative capabilities. Overall, the method broadens private distributional learning where optimal transport distances guide the objective, with potential impact on privacy-preserving representation learning and fair ML practice.
Abstract
In this work, we introduce a novel framework for privately optimizing objectives that rely on Wasserstein distances between data-dependent empirical measures. Our main theoretical contribution is, based on an explicit formulation of the Wasserstein gradient in a fully discrete setting, a control on the sensitivity of this gradient to individual data points, allowing strong privacy guarantees at minimal utility cost. Building on these insights, we develop a deep learning approach that incorporates gradient and activations clipping, originally designed for DP training of problems with a finite-sum structure. We further demonstrate that privacy accounting methods extend to Wasserstein-based objectives, facilitating large-scale private training. Empirical results confirm that our framework effectively balances accuracy and privacy, offering a theoretically sound solution for privacy-preserving machine learning tasks relying on optimal transport distances such as Wasserstein distance or sliced-Wasserstein distance.
