Progressive Prompts: Continual Learning for Language Models
Anastasia Razdaibiedina, Yuning Mao, Rui Hou, Madian Khabsa, Mike Lewis, Amjad Almahairi
TL;DR
Progressive Prompts addresses catastrophic forgetting in continual learning for large language models by learning a small per-task soft prompt for each task and progressively concatenating them while keeping the base model frozen. It introduces a residual MLP-based reparameterization to stabilize prompt optimization. Empirically, it outperforms state-of-the-art CL methods on BERT and T5 across standard benchmarks and long-sequence task settings, with over 20% gains on T5 in few-shot regimes and clear forward-transfer benefits. The approach is memory-efficient and model-agnostic, requiring far fewer task-specific parameters than full fine-tuning or architectural alternatives.
Abstract
We introduce Progressive Prompts - a simple and efficient approach for continual learning in language models. Our method allows forward transfer and resists catastrophic forgetting, without relying on data replay or a large number of task-specific parameters. Progressive Prompts learns a new soft prompt for each task and sequentially concatenates it with the previously learned prompts, while keeping the base model frozen. Experiments on standard continual learning benchmarks show that our approach outperforms state-of-the-art methods, with an improvement >20% in average test accuracy over the previous best-preforming method on T5 model. We also explore a more challenging continual learning setup with longer sequences of tasks and show that Progressive Prompts significantly outperforms prior methods.
