Is Retain Set All You Need in Machine Unlearning? Restoring Performance of Unlearned Models with Out-Of-Distribution Images
Jacopo Bonato, Marco Cotogni, Luigi Sabetta
TL;DR
The paper tackles privacy-preserving unlearning without access to a retain set and introduces SCAR, a model-agnostic method that combines metric learning with a distillation-trick to erase forget-set information while preserving test performance. It leverages the Mahalanobis distance $d_M$ to relocate forget samples to the nearest non-forget class distribution and uses a surrogate out-of-distribution dataset with Jensen-Shannon divergence $d_{JS}$ to transfer knowledge from the original model to the unlearning model. The authors also propose SCAR Self-forget, enabling class-removal without forget data, and demonstrate competitive performance across CR and HR settings on CIFAR and TinyImagenet, with architectural-agnostic results and thorough ablations. Overall, SCAR offers a retain-set-free, architecture-agnostic approach to approximate unlearning with practical implications for privacy and data rights, while outlining limitations and directions for certifiability research.
Abstract
In this paper, we introduce Selective-distillation for Class and Architecture-agnostic unleaRning (SCAR), a novel approximate unlearning method. SCAR efficiently eliminates specific information while preserving the model's test accuracy without using a retain set, which is a key component in state-of-the-art approximate unlearning algorithms. Our approach utilizes a modified Mahalanobis distance to guide the unlearning of the feature vectors of the instances to be forgotten, aligning them to the nearest wrong class distribution. Moreover, we propose a distillation-trick mechanism that distills the knowledge of the original model into the unlearning model with out-of-distribution images for retaining the original model's test performance without using any retain set. Importantly, we propose a self-forget version of SCAR that unlearns without having access to the forget set. We experimentally verified the effectiveness of our method, on three public datasets, comparing it with state-of-the-art methods. Our method obtains performance higher than methods that operate without the retain set and comparable w.r.t the best methods that rely on the retain set.
