Parameter-Efficient Augment Plugin for Class-Incremental Learning
Zhiming Xu, Baile Xu, Jian Zhao, Furao Shen, Suorong Yang
TL;DR
This paper tackles catastrophic forgetting in class-incremental learning by proposing DLC, a parameter-efficient plugin framework that injects task-specific residuals via ConvLoRA adapters into a frozen base model trained with replay and distillation. A dynamic weighting unit gates the contributions from multiple task plugins during inference, mitigating interference from non-target residuals. The two-phase training scheme keeps backbone updates decoupled from plugin learning, enabling seamless integration with non-expansion CIL baselines and delivering notable accuracy gains with modest parameter overhead (e.g., 8% on ImageNet-100 with only ~4% extra parameters). DLC also demonstrates compatibility with additional plug-and-play enhancements and under fixed memory budgets, offering a scalable, practical path toward high-performance continual learning without backbone expansion.
Abstract
Existing class-incremental learning (CIL) approaches based on replay or knowledge distillation are often constrained by forgetting or the stability-plasticity dilemma. Some expansion-based approaches could achieve higher accuracy. However, they always require significant parameter increases. In this paper, we propose a plugin extension paradigm termed the Deployment of extra LoRA Components (DLC) for non-pre-trained CIL scenarios.We treat the feature extractor trained through replay or distillation as a base model with rich knowledge. For each task, we use Low-Rank Adaptation (LoRA) to inject task-specific residuals into the base model's deep layers. During inference, representations with task-specific residuals are aggregated to produce classification predictions. To mitigate interference from non-target LoRA plugins, we introduce a lightweight weighting unit. This unit learns to assign importance scores to different LoRA-tuned representations. Like downloadable contents in software, our method serves as a plug-and-play enhancement that efficiently extends the base methods. Remarkably, on the large-scale ImageNet-100, with merely 4 % of the parameters of a standard ResNet-18, our DLC model achieves a significant 8 % improvement in accuracy, demonstrating exceptional efficiency. Moreover, it could surpass state-of-the-art methods under the fixed memory budget.
