Continual Adapter Tuning with Semantic Shift Compensation for Class-Incremental Learning
Qinhao Zhou, Yuwen Tan, Boqing Gong, Xiang Xiang
TL;DR
This work tackles class-incremental learning with pre-trained Vision Transformers by showing that incrementally tuning a shared adapter outperforms prompt-based PET methods and avoiding parameter constraints enhances plasticity. It introduces a two-stage approach: (1) continuous adapter fine-tuning and local classifier updates, and (2) retraining a unified classifier using Gaussian-feature sampling and semantic-shift compensation for old prototypes without past data. The key contributions include eliminating adapter pools and data retention, achieving state-of-the-art results on multiple CIL benchmarks, and extending the approach to FSCIL and HCIL. Overall, the method offers a cost-efficient, data-sparing path to robust continual learning with strong generalization across domains.
Abstract
Class-incremental learning (CIL) aims to enable models to continuously learn new classes while overcoming catastrophic forgetting. The introduction of pre-trained models has brought new tuning paradigms to CIL. In this paper, we revisit different parameter-efficient tuning (PET) methods within the context of continual learning. We observe that adapter tuning demonstrates superiority over prompt-based methods, even without parameter expansion in each learning session. Motivated by this, we propose incrementally tuning the shared adapter without imposing parameter update constraints, enhancing the learning capacity of the backbone. Additionally, we employ feature sampling from stored prototypes to retrain a unified classifier, further improving its performance. We estimate the semantic shift of old prototypes without access to past samples and update stored prototypes session by session. Our proposed method eliminates model expansion and avoids retaining any image samples. It surpasses previous pre-trained model-based CIL methods and demonstrates remarkable continual learning capabilities. Experimental results on five CIL benchmarks validate the effectiveness of our approach, achieving state-of-the-art (SOTA) performance.
