Learning with Locally Private Examples by Inverse Weierstrass Private Stochastic Gradient Descent
Jean Dufraiche, Paul Mangold, Michaël Perrot, Marc Tommasi
TL;DR
This work addresses bias that arises when learning from locally differentially private, one-shot data releases. It recasts privacy noise as functional transforms, the Gaussian mechanism as a Weierstrass transform and the binary Randomized Response as a Bernoulli transform, and derives their inverses to obtain unbiased loss and gradient estimators. The authors introduce Inverse Weierstrass Private SGD (IWP-SGD), provide unbiasedness proofs and convergence guarantees with an $O(1/n)$ rate, and show how the method specializes to generalized linear models. Empirical results on synthetic and real binary classification tasks demonstrate that IWP-SGD removes LDP bias present in standard SGD on noisy data, while incurring a variance penalty. Overall, the paper offers a principled, task-agnostic framework for debiasing private data and enables effective reuse of privatized data across downstream tasks.
Abstract
Releasing data once and for all under noninteractive Local Differential Privacy (LDP) enables complete data reusability, but the resulting noise may create bias in subsequent analyses. In this work, we leverage the Weierstrass transform to characterize this bias in binary classification. We prove that inverting this transform leads to a bias-correction method to compute unbiased estimates of nonlinear functions on examples released under LDP. We then build a novel stochastic gradient descent algorithm called Inverse Weierstrass Private SGD (IWP-SGD). It converges to the true population risk minimizer at a rate of $\mathcal{O}(1/n)$, with $n$ the number of examples. We empirically validate IWP-SGD on binary classification tasks using synthetic and real-world datasets.
