Continual Forgetting for Pre-trained Vision Models
Hongbo Zhao, Bolin Ni, Haochen Wang, Junsong Fan, Fei Zhu, Yuxi Wang, Yuntao Chen, Gaofeng Meng, Zhaoxiang Zhang
TL;DR
This work introduces continual forgetting for pre-trained vision models, addressing privacy-driven erasure requests that arrive sequentially. It proposes GS-LoRA, a parameter-efficient approach that inserts LoRA modules into FFN layers of Transformer blocks and uses a group sparse regularizer to automatically select which groups to modify, enabling targeted forgetting with minimal impact on retained knowledge. The method defines selective forgetting and knowledge retention losses, leveraging a small replay buffer and a sparsity-warmup strategy to balance forgetting efficacy and stability. Extensive experiments on face recognition and object detection demonstrate that GS-LoRA achieves effective forgetting with high retention of remaining knowledge, requires only a small fraction of trainable parameters, and scales to larger models, making it practical for privacy-preserving model editing.
Abstract
For privacy and security concerns, the need to erase unwanted information from pre-trained vision models is becoming evident nowadays. In real-world scenarios, erasure requests originate at any time from both users and model owners. These requests usually form a sequence. Therefore, under such a setting, selective information is expected to be continuously removed from a pre-trained model while maintaining the rest. We define this problem as continual forgetting and identify two key challenges. (i) For unwanted knowledge, efficient and effective deleting is crucial. (ii) For remaining knowledge, the impact brought by the forgetting procedure should be minimal. To address them, we propose Group Sparse LoRA (GS-LoRA). Specifically, towards (i), we use LoRA modules to fine-tune the FFN layers in Transformer blocks for each forgetting task independently, and towards (ii), a simple group sparse regularization is adopted, enabling automatic selection of specific LoRA groups and zeroing out the others. GS-LoRA is effective, parameter-efficient, data-efficient, and easy to implement. We conduct extensive experiments on face recognition, object detection and image classification and demonstrate that GS-LoRA manages to forget specific classes with minimal impact on other classes. Codes will be released on \url{https://github.com/bjzhb666/GS-LoRA}.
