Hierarchical Prompts for Rehearsal-free Continual Learning
Yukun Zuo, Hantao Yao, Lu Yu, Liansheng Zhuang, Changsheng Xu
TL;DR
This paper tackles catastrophic forgetting in rehearsal-free continual learning by introducing Hierarchical Prompts (H-Prompts), a three-tier prompt design consisting of class prompts to preserve past class distributions, task prompts to fuse past and current task knowledge, and general prompts to learn generalized representations. The method leverages Bayesian Distribution Alignment to model class distributions, Cross-task Knowledge Excavation to transfer past knowledge into the current task prompt, and Generalized Knowledge Exploration to obtain robust self-supervised features, all while keeping a frozen backbone. The total objective combines these components, and an innovative inference strategy uses a task-aware query-key mechanism to identify the appropriate task prompts during testing. Empirical results on Split CIFAR-100 and Split ImageNet-R show state-of-the-art performance with high average accuracy and low forgetting, and extensive ablations validate the contributions of each prompt type and the inference strategy. The work indicates that structured hierarchical prompts can substantially improve rehearsal-free continual learning and generalization, with potential extensions to broader vision tasks.
Abstract
Continual learning endeavors to equip the model with the capability to integrate current task knowledge while mitigating the forgetting of past task knowledge. Inspired by prompt tuning, prompt-based methods maintain a frozen backbone and train with slight learnable prompts to minimize the catastrophic forgetting that arises due to updating a large number of backbone parameters. Nonetheless, these learnable prompts tend to concentrate on the discriminatory knowledge of the current task while ignoring past task knowledge, leading to that learnable prompts still suffering from catastrophic forgetting. This paper introduces a novel rehearsal-free paradigm for continual learning termed Hierarchical Prompts (H-Prompts), comprising three categories of prompts -- class prompt, task prompt, and general prompt. To effectively depict the knowledge of past classes, class prompt leverages Bayesian Distribution Alignment to model the distribution of classes in each task. To reduce the forgetting of past task knowledge, task prompt employs Cross-task Knowledge Excavation to amalgamate the knowledge encapsulated in the learned class prompts of past tasks and current task knowledge. Furthermore, general prompt utilizes Generalized Knowledge Exploration to deduce highly generalized knowledge in a self-supervised manner. Evaluations on two benchmarks substantiate the efficacy of the proposed H-Prompts, exemplified by an average accuracy of 87.8% in Split CIFAR-100 and 70.6% in Split ImageNet-R.
