Towards Unbounded Machine Unlearning
Meghdad Kurmanji, Peter Triantafillou, Jamie Hayes, Eleni Triantafillou
TL;DR
This work tackles deep machine unlearning by proposing SCRUB, a scalable teacher-student framework that selectively forgets data while preserving retained knowledge. SCRUB optimizes to maximize forgetting on the forget set and minimize impact on retained data, with a practical rewinding variant (SCRUB+R) to balance privacy and utility. It unifies three applications—removing biases, resolving label confusion, and user privacy—by evaluating across diverse metrics and datasets, and it introduces a LiRA-adapted MIA to assess privacy defenses. Empirically, SCRUB consistently outperforms baselines in forgetting quality and utility, offering a practical, scalable solution with strong defense against membership inference attacks, while acknowledging the need for formal guarantees and broader scalability in future work.
Abstract
Deep machine unlearning is the problem of `removing' from a trained neural network a subset of its training set. This problem is very timely and has many applications, including the key tasks of removing biases (RB), resolving confusion (RC) (caused by mislabelled data in trained models), as well as allowing users to exercise their `right to be forgotten' to protect User Privacy (UP). This paper is the first, to our knowledge, to study unlearning for different applications (RB, RC, UP), with the view that each has its own desiderata, definitions for `forgetting' and associated metrics for forget quality. For UP, we propose a novel adaptation of a strong Membership Inference Attack for unlearning. We also propose SCRUB, a novel unlearning algorithm, which is the only method that is consistently a top performer for forget quality across the different application-dependent metrics for RB, RC, and UP. At the same time, SCRUB is also consistently a top performer on metrics that measure model utility (i.e. accuracy on retained data and generalization), and is more efficient than previous work. The above are substantiated through a comprehensive empirical evaluation against previous state-of-the-art.
