DualOptim: Enhancing Efficacy and Stability in Machine Unlearning with Dual Optimizers
Xuyang Zhong, Haochen Luo, Chen Liu
TL;DR
The paper tackles the instability and hyperparameter sensitivity of approximate machine unlearning by introducing DualOptim, a plug-and-play framework that uses adaptive learning rates for forgetting and decoupled momentum for forgetting and retaining objectives. The approach is supported by theoretical guarantees showing reduced parameter-variance with decoupled momentum and by extensive experiments across image classification, image generation, and large language models, where it improves forgetting efficacy and stability relative to strong baselines. Key contributions include the formalization of a two-objective MU problem, the demonstration that decoupled momentum reduces worst-case variance, and the empirical validation of a practical, generalizable optimization scheme that pushes state-of-the-art in MU. The work has practical impact for deploying trustworthy MU systems in real-world settings, offering a robust, modular method that can be integrated with existing MU algorithms. $\min_{\theta} \mathcal{L}_f(\mathcal{D}_f, \theta) + \mathcal{L}_r(\mathcal{D}_r, \theta)$, adaptive learning rates, and decoupled momentum together enable more stable and effective forgetting across diverse tasks.
Abstract
Existing machine unlearning (MU) approaches exhibit significant sensitivity to hyperparameters, requiring meticulous tuning that limits practical deployment. In this work, we first empirically demonstrate the instability and suboptimal performance of existing popular MU methods when deployed in different scenarios. To address this issue, we propose Dual Optimizer (DualOptim), which incorporates adaptive learning rate and decoupled momentum factors. Empirical and theoretical evidence demonstrates that DualOptim contributes to effective and stable unlearning. Through extensive experiments, we show that DualOptim can significantly boost MU efficacy and stability across diverse tasks, including image classification, image generation, and large language models, making it a versatile approach to empower existing MU algorithms.
