Forget Vectors at Play: Universal Input Perturbations Driving Machine Unlearning in Image Classification
Changchang Sun, Ren Wang, Yihua Zhang, Jinghan Jia, Jiancheng Liu, Gaowen Liu, Yan Yan, Sijia Liu
TL;DR
This work tackles machine unlearning (MU) for image classification by introducing forget vectors, a universal input perturbation that enables unlearning without altering model weights. By formulating a data-based MU objective that combines a forget loss with retain data regularization, the approach demonstrates competitive unlearning effectiveness (UA) and strong MIA-Efficacy relative to model-based methods, while incurring trade-offs in utility (RA, TA). A key innovation is compositional unlearning via forget vector arithmetic, enabling transfer of class-wise forget vectors to generate new forget vectors for unseen forgetting tasks. Extensive experiments on CIFAR-10 and ImageNet-10 validate the method’s effectiveness, transferability, and interpretability through saliency analyses, with practical advantages in storage and parameter efficiency. The work presents a promising, scalable alternative to retraining-based MU, with potential impact on privacy-compliant data removal and rapid model editing in vision systems.
Abstract
Machine unlearning (MU), which seeks to erase the influence of specific unwanted data from already-trained models, is becoming increasingly vital in model editing, particularly to comply with evolving data regulations like the ``right to be forgotten''. Conventional approaches are predominantly model-based, typically requiring retraining or fine-tuning the model's weights to meet unlearning requirements. In this work, we approach the MU problem from a novel input perturbation-based perspective, where the model weights remain intact throughout the unlearning process. We demonstrate the existence of a proactive input-based unlearning strategy, referred to forget vector, which can be generated as an input-agnostic data perturbation and remains as effective as model-based approximate unlearning approaches. We also explore forget vector arithmetic, whereby multiple class-specific forget vectors are combined through simple operations (e.g., linear combinations) to generate new forget vectors for unseen unlearning tasks, such as forgetting arbitrary subsets across classes. Extensive experiments validate the effectiveness and adaptability of the forget vector, showcasing its competitive performance relative to state-of-the-art model-based methods. Codes are available at https://github.com/Changchangsun/Forget-Vector.
