MePo: Meta Post-Refinement for Rehearsal-Free General Continual Learnin
Guanglong Sun, Hongwei Yan, Liyuan Wang, Zhiqi Kang, Shuang Cui, Hang Su, Jun Zhu, Yi Zhong
TL;DR
This paper tackles the challenge of general continual learning (GCL) under rehearsal-free constraints by introducing Meta Post-Refinement (MePo). MePo constructs pseudo task sequences from pretraining data and employs a bi-level meta-learning framework to refine the pretrained backbone, yielding a CL-tailored initialization that adapts rapidly to downstream GCL tasks. Additionally, MePo initializes a meta covariance matrix to align second-order statistics of feature representations, stabilizing output predictions through covariance-based alignment. Empirical results across CIFAR-100, ImageNet-R, and CUB-200 show substantial improvements over strong PTMs-based baselines with minimal overhead, especially when using self-supervised pretrained checkpoints, demonstrating the method’s effectiveness and efficiency in rehearsal-free GCL settings.
Abstract
To cope with uncertain changes of the external world, intelligent systems must continually learn from complex, evolving environments and respond in real time. This ability, collectively known as general continual learning (GCL), encapsulates practical challenges such as online datastreams and blurry task boundaries. Although leveraging pretrained models (PTMs) has greatly advanced conventional continual learning (CL), these methods remain limited in reconciling the diverse and temporally mixed information along a single pass, resulting in sub-optimal GCL performance. Inspired by meta-plasticity and reconstructive memory in neuroscience, we introduce here an innovative approach named Meta Post-Refinement (MePo) for PTMs-based GCL. This approach constructs pseudo task sequences from pretraining data and develops a bi-level meta-learning paradigm to refine the pretrained backbone, which serves as a prolonged pretraining phase but greatly facilitates rapid adaptation of representation learning to downstream GCL tasks. MePo further initializes a meta covariance matrix as the reference geometry of pretrained representation space, enabling GCL to exploit second-order statistics for robust output alignment. MePo serves as a plug-in strategy that achieves significant performance gains across a variety of GCL benchmarks and pretrained checkpoints in a rehearsal-free manner (e.g., 15.10\%, 13.36\%, and 12.56\% on CIFAR-100, ImageNet-R, and CUB-200 under Sup-21/1K). Our source code is available at \href{https://github.com/SunGL001/MePo}{MePo}
