Label Smoothing Improves Machine Unlearning
Zonglin Di, Zhaowei Zhu, Jinghan Jia, Jiancheng Liu, Zafar Takhirov, Bo Jiang, Yuanshun Yao, Sijia Liu, Yang Liu
TL;DR
MU aims to remove the influence of forgetting data with lower cost than retraining. The authors introduce UGradSL, a plug‑and‑play gradient-based unlearning method that couples gradient ascent on the forgetting data with gradient descent on retained data, augmented by generalized label smoothing (GLS) with a negative smoothing rate $\alpha<0$. They provide theoretical results showing when GA helps MU and how GLS enhances MU, including privacy considerations in the form of $\epsilon$-Label-LDP. Empirically, across six datasets and multiple forgetting paradigms, UGradSL achieves robust unlearning gains with modest additional computation, e.g., up to a $66\%$ improvement in unlearning accuracy over GA baselines, demonstrating a scalable approach to MU that balances forgetting strength and data utility.
Abstract
The objective of machine unlearning (MU) is to eliminate previously learned data from a model. However, it is challenging to strike a balance between computation cost and performance when using existing MU techniques. Taking inspiration from the influence of label smoothing on model confidence and differential privacy, we propose a simple gradient-based MU approach that uses an inverse process of label smoothing. This work introduces UGradSL, a simple, plug-and-play MU approach that uses smoothed labels. We provide theoretical analyses demonstrating why properly introducing label smoothing improves MU performance. We conducted extensive experiments on six datasets of various sizes and different modalities, demonstrating the effectiveness and robustness of our proposed method. The consistent improvement in MU performance is only at a marginal cost of additional computations. For instance, UGradSL improves over the gradient ascent MU baseline by 66% unlearning accuracy without sacrificing unlearning efficiency.
