Uncertainty quantification by block bootstrap for differentially private stochastic gradient descent
Holger Dette, Carina Graw
TL;DR
The paper tackles uncertainty quantification for SGD under local differential privacy by introducing a multiplier block bootstrap that accounts for the strong dependence in SGD iterates without requiring privacy budget splitting. It proves that, under appropriate martingale-difference and regularity conditions, the LDP-SGD estimator satisfies \\sqrt{n}(\\bar{\\theta}^{LDP}-\\theta_\\star) \\overset{d}{\\to} N(0,\\Sigma)$ with \\Sigma = G^{-1} S G^{-1}$, where $S$ aggregates both SGD and LDP noise and $G$ linearizes the gradient. The proposed bootstrap statistic \\bar{\\theta}^{\\star} mimics this limit distribution using block structure and random multipliers, achieving consistency and practical finite-sample performance with a fixed privacy budget. This yields a computationally tractable approach to UQ for LDP-SGD and, as a by-product, provides a simple UQ tool for non-private SGD. Simulations on quantile estimation and quantile regression demonstrate competitive coverage and interval lengths, underscoring the method's applicability to large-scale, privacy-preserving inference.
Abstract
Stochastic Gradient Descent (SGD) is a widely used tool in machine learning. In the context of Differential Privacy (DP), SGD has been well studied in the last years in which the focus is mainly on convergence rates and privacy guarantees. While in the non private case, uncertainty quantification (UQ) for SGD by bootstrap has been addressed by several authors, these procedures cannot be transferred to differential privacy due to multiple queries to the private data. In this paper, we propose a novel block bootstrap for SGD under local differential privacy that is computationally tractable and does not require an adjustment of the privacy budget. The method can be easily implemented and is applicable to a broad class of estimation problems. We prove the validity of our approach and illustrate its finite sample properties by means of a simulation study. As a by-product, the new method also provides a simple alternative numerical tool for UQ for non-private SGD.
