Instance-Level Difficulty: A Missing Perspective in Machine Unlearning
Hammad Rizwan, Mahtab Sarvmaili, Hassan Sajjad, Ga Wu
TL;DR
The paper shows that machine unlearning is not uniformly feasible across individual data points; by defining a ground-truth harmonic score and evaluating six candidate factors across multiple algorithms and datasets, it finds that four factors consistently relate to unlearning difficulty, while several commonly used indices fail to predict per-sample outcomes. The work demonstrates the need for instance-level evaluation and proposes that post-unlearning performance is a strong practical indicator of difficulty, urging the development of a unified predictive index. It also contrasts data-centric and model-centric perspectives to understand unlearning challenges and calls for more nuanced metrics and methods to support real-world Right-to-be-Forgotten requirements.
Abstract
Current research on deep machine unlearning primarily focuses on improving or evaluating the overall effectiveness of unlearning methods while overlooking the varying difficulty of unlearning individual training samples. As a result, the broader feasibility of machine unlearning remains under-explored. This paper studies the cruxes that make machine unlearning difficult through a thorough instance-level unlearning performance analysis over various unlearning algorithms and datasets. In particular, we summarize four factors that make unlearning a data point difficult, and we empirically show that these factors are independent of a specific unlearning algorithm but only relevant to the target model and its training data. Given these findings, we argue that machine unlearning research should pay attention to the instance-level difficulty of unlearning.
