CP-Prompt: Composition-Based Cross-modal Prompting for Domain-Incremental Continual Learning
Yu Feng, Zhen Tian, Yifan Zhu, Zongfu Han, Haoran Luo, Guangwei Zhang, Meina Song
TL;DR
CP-Prompt introduces a parameter-efficient twin-prompting framework for cross-modal domain-incremental continual learning by combining inter-domain common prompts with intra-domain personalized prompts. Common prompts are learned across domains and frozen progressively, while Prefix-One personalized prompts are injected into multi-head self-attention to encode domain-specific semantics, with extensions to the text encoder. The approach yields superior performance on CDDB-Hard, CORe50, and DomainNet with minimal parameter updates (approximately 0.22% of parameters) and reduced forgetting. Empirical analyses, including ablations and attention visualizations, demonstrate that CP-Prompt effectively preserves cross-domain knowledge while capturing domain-specific nuances, outperforming state-of-the-art exemplar-free baselines and competitive prompting methods.
Abstract
The key challenge of cross-modal domain-incremental learning (DIL) is to enable the learning model to continuously learn from novel data with different feature distributions under the same task without forgetting old ones. However, existing top-performing methods still cause high forgetting rates, by lacking intra-domain knowledge extraction and inter-domain common prompting strategy. In this paper, we propose a simple yet effective framework, CP-Prompt, by training limited parameters to instruct a pre-trained model to learn new domains and avoid forgetting existing feature distributions. CP-Prompt captures intra-domain knowledge by compositionally inserting personalized prompts on multi-head self-attention layers and then learns the inter-domain knowledge with a common prompting strategy. CP-Prompt shows superiority compared with state-of-the-art baselines among three widely evaluated DIL tasks. The source code is available at https://github.com/dannis97500/CP_Prompt.
