Improving Plasticity in Online Continual Learning via Collaborative Learning
Maorong Wang, Nicolas Michel, Ling Xiao, Toshihiko Yamasaki
TL;DR
This work reframes online Continual Learning by foregrounding model plasticity alongside stability. It introduces Collaborative Continual Learning with Distillation Chain (CCL-DC), a two-peer collaborative framework augmented by a Distillation Chain to regulate prediction entropy, designed to be compatible with existing online CL methods. By defining Learning Accuracy ($LA$) and Relative Forgetting ($RF$) and deriving an approximate relation $AA \gtrapprox LA \times (1 - RF)$, the paper quantifies how plasticity and stability jointly determine final performance. Across four image-classification datasets and multiple baselines, CCL-DC consistently boosts plasticity and final accuracy, with substantial gains that persist across memory budgets, and it offers extensive ablations, visualizations, and implementation details. The approach demonstrates a scalable, effective pathway to improve online CL by leveraging collaborative learning and entropy-regularized distillation.
Abstract
Online Continual Learning (CL) solves the problem of learning the ever-emerging new classification tasks from a continuous data stream. Unlike its offline counterpart, in online CL, the training data can only be seen once. Most existing online CL research regards catastrophic forgetting (i.e., model stability) as almost the only challenge. In this paper, we argue that the model's capability to acquire new knowledge (i.e., model plasticity) is another challenge in online CL. While replay-based strategies have been shown to be effective in alleviating catastrophic forgetting, there is a notable gap in research attention toward improving model plasticity. To this end, we propose Collaborative Continual Learning (CCL), a collaborative learning based strategy to improve the model's capability in acquiring new concepts. Additionally, we introduce Distillation Chain (DC), a collaborative learning scheme to boost the training of the models. We adapt CCL-DC to existing representative online CL works. Extensive experiments demonstrate that even if the learners are well-trained with state-of-the-art online CL methods, our strategy can still improve model plasticity dramatically, and thereby improve the overall performance by a large margin. The source code of our work is available at https://github.com/maorong-wang/CCL-DC.
