Soft Weighted Machine Unlearning
Xinbao Qiao, Ningning Ding, Yushi Cheng, Meng Zhang
TL;DR
The paper tackles over-unlearning in non-privacy machine unlearning by shifting from binary data removal to a soft-weighted, influence-based approach. It introduces a weighted influence function with per-sample weights $\epsilon_j$ and solves a convex quadratic program to discover optimal weights, enabling fine-grained model corrections that maintain utility while improving fairness and robustness. The framework is designed to be compatible with a wide range of unlearning methods and shows consistent improvements across multiple fairness and robustness tasks, including scenarios with large models, while adding negligible overhead. This work provides a practical and theoretically motivated path to mitigate over-unlearning and enhance the applicability of unlearning to real-world non-privacy objectives.
Abstract
Machine unlearning, as a post-hoc processing technique, has gained widespread adoption in addressing challenges like bias mitigation and robustness enhancement, colloquially, machine unlearning for fairness and robustness. However, existing non-privacy unlearning-based solutions persist in using binary data removal framework designed for privacy-driven motivation, leading to significant information loss, a phenomenon known as over-unlearning. While over-unlearning has been largely described in many studies as primarily causing utility degradation, we investigate its fundamental causes and provide deeper insights in this work through counterfactual leave-one-out analysis. In this paper, we introduce a weighted influence function that assigns tailored weights to each sample by solving a convex quadratic programming problem analytically. Building on this, we propose a soft-weighted framework enabling fine-grained model adjustments to address the over-unlearning challenge. We demonstrate that the proposed soft-weighted scheme is versatile and can be seamlessly integrated into most existing unlearning algorithms. Extensive experiments show that in fairness- and robustness-driven tasks, the soft-weighted scheme significantly outperforms hard-weighted schemes in fairness/robustness metrics and alleviates the decline in utility metric, thereby enhancing machine unlearning algorithm as an effective correction solution.
