Scissorhands: Scrub Data Influence via Connection Sensitivity in Networks
Jing Wu, Mehrtash Harandi
TL;DR
Scissorhands addresses the problem of erasing a forgetting data's influence from trained models under data-privacy regulations. It introduces a two-stage workflow that first trims data-influential parameters using connection sensitivity (top-$k$%) and then repairs the model with a gradient projection approach to maximize remaining-data utility while suppressing forgetting-data influence. The method demonstrates competitive forgetting-utility trade-offs across image classification and generation tasks, including a Stable Diffusion case study, supported by ablations and case analyses. This work provides a scalable, principled approach to privacy-preserving unlearning with practical implications for deployment and policy compliance.
Abstract
Machine unlearning has become a pivotal task to erase the influence of data from a trained model. It adheres to recent data regulation standards and enhances the privacy and security of machine learning applications. In this work, we present a new machine unlearning approach Scissorhands. Initially, Scissorhands identifies the most pertinent parameters in the given model relative to the forgetting data via connection sensitivity. By reinitializing the most influential top-k percent of these parameters, a trimmed model for erasing the influence of the forgetting data is obtained. Subsequently, Scissorhands fine-tunes the trimmed model with a gradient projection-based approach, seeking parameters that preserve information on the remaining data while discarding information related to the forgetting data. Our experimental results, conducted across image classification and image generation tasks, demonstrate that Scissorhands, showcases competitive performance when compared to existing methods. Source code is available at https://github.com/JingWu321/Scissorhands.
