A Unified Gradient-based Framework for Task-agnostic Continual Learning-Unlearning
Zhehao Huang, Xinwen Cheng, Jie Zhang, Jinghao Zheng, Haoran Wang, Zhengbao He, Tao Li, Xiaolin Huang
TL;DR
This work addresses the need for systems that learn continually while unlearning specific data; it proposes a unified gradient-based continual learning-unlearning framework built on $D_{KL}$ minimization that aligns learning, unlearning, and retention under a remain-preserved manifold. The approach decomposes the gradient into four components—learning, unlearning, remaining-knowledge preservation, and a weight saliency modulation—coupled with an implicit online Hessian via a fast-slow update and adaptive sample weighting to balance plasticity and stability. Key contributions include a four-term gradient decomposition, the remain-preserved Hessian constraint, an efficient Hessian-approximate update mechanism, a balanced weight saliency mask, and a task-agnostic CLU paradigm with cross-task and random-sample unlearning benchmarks. Empirically, UG-CLU coordinates incremental learning, precise unlearning, and knowledge stability across CIFAR-10 and TinyImageNet with diverse architectures, outperforming task-aware baselines and ablation studies confirming the utility of each component. The work provides a theoretical foundation and practical framework for dynamic, privacy-aware lifelong learning systems with fine-grained unlearning capabilities.
Abstract
Recent advancements in deep models have highlighted the need for intelligent systems that combine continual learning (CL) for knowledge acquisition with machine unlearning (MU) for data removal, forming the Continual Learning-Unlearning (CLU) paradigm. While existing work treats CL and MU as separate processes, we reveal their intrinsic connection through a unified optimization framework based on Kullback-Leibler divergence minimization. This framework decomposes gradient updates for approximate CLU into four components: learning new knowledge, unlearning targeted data, preserving existing knowledge, and modulation via weight saliency. A critical challenge lies in balancing knowledge update and retention during sequential learning-unlearning cycles. To resolve this stability-plasticity dilemma, we introduce a remain-preserved manifold constraint to induce a remaining Hessian compensation for CLU iterations. A fast-slow weight adaptation mechanism is designed to efficiently approximate the second-order optimization direction, combined with adaptive weighting coefficients and a balanced weight saliency mask, proposing a unified implementation framework for gradient-based CLU. Furthermore, we pioneer task-agnostic CLU scenarios that support fine-grained unlearning at the cross-task category and random sample levels beyond the traditional task-aware setups. Experiments demonstrate that the proposed UG-CLU framework effectively coordinates incremental learning, precise unlearning, and knowledge stability across multiple datasets and model architectures, providing a theoretical foundation and methodological support for dynamic, compliant intelligent systems.
