Prompt Customization for Continual Learning
Yong Dai, Xiaopeng Hong, Yabin Wang, Zhiheng Ma, Dongmei Jiang, Yaowei Wang
TL;DR
This work tackles prompt-based continual learning by addressing the instability of hard prompt selection as tasks accumulate. It introduces Prompt Customization (PC), which combines a Prompt Generation Module (PGM) and a Prompt Modulation Module (PMM) to generate and adapt instance-specific prompts from a fixed codebook, removing the need for task-wise prompt selection. The generated prompts are integrated into a frozen Vision Transformer backbone, with a momentum-updated codebook and regularization to preserve old knowledge, achieving up to 16.2% gains in average accuracy across class, domain, and task-agnostic settings. The approach demonstrates strong scalability and robustness across four datasets, suggesting practical impact for continual learning in dynamic environments and motivating future exploration of multimodal and codebook-free prompt strategies.
Abstract
Contemporary continual learning approaches typically select prompts from a pool, which function as supplementary inputs to a pre-trained model. However, this strategy is hindered by the inherent noise of its selection approach when handling increasing tasks. In response to these challenges, we reformulate the prompting approach for continual learning and propose the prompt customization (PC) method. PC mainly comprises a prompt generation module (PGM) and a prompt modulation module (PMM). In contrast to conventional methods that employ hard prompt selection, PGM assigns different coefficients to prompts from a fixed-sized pool of prompts and generates tailored prompts. Moreover, PMM further modulates the prompts by adaptively assigning weights according to the correlations between input data and corresponding prompts. We evaluate our method on four benchmark datasets for three diverse settings, including the class, domain, and task-agnostic incremental learning tasks. Experimental results demonstrate consistent improvement (by up to 16.2\%), yielded by the proposed method, over the state-of-the-art (SOTA) techniques.
