Teaching Prompts to Coordinate: Hierarchical Layer-Grouped Prompt Tuning for Continual Learning

Shengqin Jiang; Tianqi Kong; Yuankai Qi; Haokui Zhang; Lina Yao; Quan Z. Sheng; Qingshan Liu; Ming-Hsuan Yang

Teaching Prompts to Coordinate: Hierarchical Layer-Grouped Prompt Tuning for Continual Learning

Shengqin Jiang, Tianqi Kong, Yuankai Qi, Haokui Zhang, Lina Yao, Quan Z. Sheng, Qingshan Liu, Ming-Hsuan Yang

TL;DR

This work tackles catastrophic forgetting in continual learning by introducing hierarchical layer-grouped prompt tuning (HLGP). HLGP replaces independent per-layer prompts with shared implicit prompts generated for layer groups from a single task-specific root prompt $p_t$, using intermediate adapters to map to group prompts and adding a position incentive embedding (PIE) to preserve layer order. A soft task matching (SoTM) mechanism weights sub-prompts based on inference dynamics, enabling effective fusion without explicit task IDs. Across CIFAR-100, IN-R, IN-A, and VTAB, HLGP with PIE and SoTM delivers state-of-the-art final and continual accuracy while reducing the number of trainable parameters, demonstrating robustness and practical utility for prompt-based continual learning.

Abstract

Prompt-based continual learning methods fine-tune only a small set of additional learnable parameters while keeping the pre-trained model's parameters frozen. It enables efficient adaptation to new tasks while mitigating the risk of catastrophic forgetting. These methods typically attach one independent task-specific prompt to each layer of pre-trained models to locally modulate its features, ensuring that the layer's representation aligns with the requirements of the new task. However, although introducing learnable prompts independently at each layer provides high flexibility for adapting to new tasks, this overly flexible tuning could make certain layers susceptible to unnecessary updates. As all prompts till the current task are added together as a final prompt for all seen tasks, the model may easily overwrite feature representations essential to previous tasks, which increases the risk of catastrophic forgetting. To address this issue, we propose a novel hierarchical layer-grouped prompt tuning method for continual learning. It improves model stability in two ways: (i) Layers in the same group share roughly the same prompts, which are adjusted by position encoding. This helps preserve the intrinsic feature relationships and propagation pathways of the pre-trained model within each group. (ii) It utilizes a single task-specific root prompt to learn to generate sub-prompts for each layer group. In this way, all sub-prompts are conditioned on the same root prompt, enhancing their synergy and reducing independence. Extensive experiments across four benchmarks demonstrate that our method achieves favorable performance compared with several state-of-the-art methods.

Teaching Prompts to Coordinate: Hierarchical Layer-Grouped Prompt Tuning for Continual Learning

TL;DR

Abstract

Teaching Prompts to Coordinate: Hierarchical Layer-Grouped Prompt Tuning for Continual Learning

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)