Certified Machine Unlearning via Noisy Stochastic Gradient Descent
Eli Chien, Haoyu Wang, Ziang Chen, Pan Li
TL;DR
This work introduces certified machine unlearning via projected noisy SGD (PNSGD) for convex objectives, establishing a first approximate unlearning guarantee by linking the initial $W_\infty$ distance between adjacent learning processes to the final Rényi divergence through contractive noisy iterations. The method yields explicit RU guarantees with bounds that depend on mini-batch size, step size, and data geometry, and demonstrates substantial computational savings over retraining while preserving utility under privacy constraints. The analysis scales to sequential and batch unlearning via $W_\infty$ tracking and triangle inequalities, and is complemented by experiments on MNIST and CIFAR-10 showing strong practicality with only a small fraction of gradient computations compared to baselines. Limitations include the strong convexity requirement, with future work potentially extending to non-convex settings using Langevin dynamics and broader unlearning scenarios.
Abstract
``The right to be forgotten'' ensured by laws for user data privacy becomes increasingly important. Machine unlearning aims to efficiently remove the effect of certain data points on the trained model parameters so that it can be approximately the same as if one retrains the model from scratch. We propose to leverage projected noisy stochastic gradient descent for unlearning and establish its first approximate unlearning guarantee under the convexity assumption. Our approach exhibits several benefits, including provable complexity saving compared to retraining, and supporting sequential and batch unlearning. Both of these benefits are closely related to our new results on the infinite Wasserstein distance tracking of the adjacent (un)learning processes. Extensive experiments show that our approach achieves a similar utility under the same privacy constraint while using $2\%$ and $10\%$ of the gradient computations compared with the state-of-the-art gradient-based approximate unlearning methods for mini-batch and full-batch settings, respectively.
