Continual Learning in the Presence of Repetition
Hamed Hemati, Lorenzo Pellegrini, Xiaotian Duan, Zixuan Zhao, Fangfang Xia, Marc Masana, Benedikt Tscheschner, Eduardo Veas, Yuxiang Zheng, Shiji Zhao, Shao-Yuan Li, Sheng-Jun Huang, Vincenzo Lomonaco, Gido M. van de Ven
TL;DR
This paper investigates continual learning under repetition (CIR) through the CLVision challenge at CVPR 2023, addressing how repeated exposure of past classes within a stream impacts learning. It introduces a four-parameter data-stream generator to create CIR benchmarks and analyzes three finalist ensemble-based strategies—HAT-CIR, Horde, and DWGRNet—that exploit repetition without storing raw samples. The results show that ensemble approaches substantially outperform baselines in CIR scenarios, with HAT-CIR achieving the best overall performance, while removing repetition reduces their advantage, underscoring the pivotal role of repetition in shaping strategy effectiveness. The findings suggest that natural repetition can act as a form of regime-like bagging, enabling richer representations and robust continual learning, and point to future work on scaling ensembles and exploring larger architectures for CIR tasks.
Abstract
Continual learning (CL) provides a framework for training models in ever-evolving environments. Although re-occurrence of previously seen objects or tasks is common in real-world problems, the concept of repetition in the data stream is not often considered in standard benchmarks for CL. Unlike with the rehearsal mechanism in buffer-based strategies, where sample repetition is controlled by the strategy, repetition in the data stream naturally stems from the environment. This report provides a summary of the CLVision challenge at CVPR 2023, which focused on the topic of repetition in class-incremental learning. The report initially outlines the challenge objective and then describes three solutions proposed by finalist teams that aim to effectively exploit the repetition in the stream to learn continually. The experimental results from the challenge highlight the effectiveness of ensemble-based solutions that employ multiple versions of similar modules, each trained on different but overlapping subsets of classes. This report underscores the transformative potential of taking a different perspective in CL by employing repetition in the data stream to foster innovative strategy design.
