An Efficient Replay for Class-Incremental Learning with Pre-trained Models
Weimin Yin, Bin Chen adn Chunzhao Xie, Zhenhao Tan
TL;DR
This paper tackles catastrophic forgetting in class-incremental learning when leveraging large pretrained models. It introduces Weight Balancing Replay (WBR), which uses a single memory vector per past task to guide gradient updates and balance old vs. new knowledge without maintaining large replay buffers. By extending the notion of bias from the classifier to arbitrary network weights and computing approximate bias with memory-driven activations under gradient-clip constraints, WBR achieves fast, parameter-efficient forgetting mitigation and strong performance on PTM-based benchmarks. The approach is particularly impactful for real-world deployments where memory and compute are constrained, as it preserves accuracy while dramatically reducing training cost.
Abstract
In general class-incremental learning, researchers typically use sample sets as a tool to avoid catastrophic forgetting during continuous learning. At the same time, researchers have also noted the differences between class-incremental learning and Oracle training and have attempted to make corrections. In recent years, researchers have begun to develop class-incremental learning algorithms utilizing pre-trained models, achieving significant results. This paper observes that in class-incremental learning, the steady state among the weight guided by each class center is disrupted, which is significantly correlated with catastrophic forgetting. Based on this, we propose a new method to overcoming forgetting . In some cases, by retaining only a single sample unit of each class in memory for replay and applying simple gradient constraints, very good results can be achieved. Experimental results indicate that under the condition of pre-trained models, our method can achieve competitive performance with very low computational cost and by simply using the cross-entropy loss.
