Learn to Forget: Machine Unlearning via Neuron Masking
Yang Liu, Zhuo Ma, Ximeng Liu, Jian Liu, Zhongyuan Jiang, Jianfeng Ma, Philip Yu, Kui Ren
TL;DR
This paper tackles the right-to-forget challenge in neural networks by introducing Forsaken, a neuron-masking unlearning method that uses a mask gradient generator to erase memorization of chosen data. It also defines forgetting rate (FR), a uniform metric based on a membership oracle, to quantify unlearning effectiveness. Forsaken demonstrates strong forgetting performance (average FR > 90%) with minimal utility loss (<5% accuracy drop) across eight diverse datasets, and offers practical efficiency advantages over retraining-based approaches. The approach is shown to generalize beyond neural networks to LR and SVM, and is adaptable to federated learning contexts, enabling scalable, data-subset forgetting with reduced privacy risks from unintended memorization.
Abstract
Nowadays, machine learning models, especially neural networks, become prevalent in many real-world applications.These models are trained based on a one-way trip from user data: as long as users contribute their data, there is no way to withdraw; and it is well-known that a neural network memorizes its training data. This contradicts the "right to be forgotten" clause of GDPR, potentially leading to law violations. To this end, machine unlearning becomes a popular research topic, which allows users to eliminate memorization of their private data from a trained machine learning model.In this paper, we propose the first uniform metric called for-getting rate to measure the effectiveness of a machine unlearning method. It is based on the concept of membership inference and describes the transformation rate of the eliminated data from "memorized" to "unknown" after conducting unlearning. We also propose a novel unlearning method calledForsaken. It is superior to previous work in either utility or efficiency (when achieving the same forgetting rate). We benchmark Forsaken with eight standard datasets to evaluate its performance. The experimental results show that it can achieve more than 90\% forgetting rate on average and only causeless than 5\% accuracy loss.
