AnaCP: Toward Upper-Bound Continual Learning via Analytic Contrastive Projection
Saleh Momeni, Changnan Xiao, Bing Liu
TL;DR
AnaCP tackles class-incremental learning by enabling analytic, gradient-free feature adaptation of fixed PTM features through a contrastive projection layer. It combines positive alignment via prototype regression with negative repulsion via target-prototype separation, followed by an analytic classifier and pseudo-replay with a shared covariance to maintain CF-resilience. The approach yields accuracies close to or surpassing joint training on several benchmarks, with strong memory-time efficiency despite a larger parameter footprint, and remains robust to catastrophic forgetting. Overall, AnaCP presents a scalable, CF-free pathway to leverage powerful PTMs for continual learning, with room to improve when weaker PTMs are used and potential extension to task- or domain-incremental settings.
Abstract
This paper studies the problem of class-incremental learning (CIL), a core setting within continual learning where a model learns a sequence of tasks, each containing a distinct set of classes. Traditional CIL methods, which do not leverage pre-trained models (PTMs), suffer from catastrophic forgetting (CF) due to the need to incrementally learn both feature representations and the classifier. The integration of PTMs into CIL has recently led to efficient approaches that treat the PTM as a fixed feature extractor combined with analytic classifiers, achieving state-of-the-art performance. However, they still face a major limitation: the inability to continually adapt feature representations to best suit the CIL tasks, leading to suboptimal performance. To address this, we propose AnaCP (Analytic Contrastive Projection), a novel method that preserves the efficiency of analytic classifiers while enabling incremental feature adaptation without gradient-based training, thereby eliminating the CF caused by gradient updates. Our experiments show that AnaCP not only outperforms existing baselines but also achieves the accuracy level of joint training, which is regarded as the upper bound of CIL.
