Cut out and Replay: A Simple yet Versatile Strategy for Multi-Label Online Continual Learning

Xinrui Wang; Shao-yuan Li; Jiaqiang Zhang; Songcan Chen

Cut out and Replay: A Simple yet Versatile Strategy for Multi-Label Online Continual Learning

Xinrui Wang, Shao-yuan Li, Jiaqiang Zhang, Songcan Chen

TL;DR

MOCL contends with pervasive missing labels and long-tailed class distributions in streaming multi-label data. The authors propose CUTER, a simple plug-in strategy that identifies, strengthens, and replays label-specific regions to support fine-grained experience replay. It comprises three components: zero-shot localization assessment of pre-trained models using the average Fiedler value $\lambda_2$ of patch graphs, selective replay via label-region matching with MCut to extract object crops for memory, and a localization-aware regularization using the nuclear norm $\|A\|_*$ to stabilize patch graphs. Experiments on VOC 2007, MS-COCO, and NUS-WIDE show state-of-the-art MOCL performance and confirm the method's plug-in compatibility with existing approaches. The work provides a theoretical grounding using graph spectral theory and highlights trade-offs with computational overhead and backbone choices.

Abstract

Multi-Label Online Continual Learning (MOCL) requires models to learn continuously from endless multi-label data streams, facing complex challenges including persistent catastrophic forgetting, potential missing labels, and uncontrollable imbalanced class distributions. While existing MOCL methods attempt to address these challenges through various techniques, \textit{they all overlook label-specific region identifying and feature learning} - a fundamental solution rooted in multi-label learning but challenging to achieve in the online setting with incremental and partial supervision. To this end, we first leverage the inherent structural information of input data to evaluate and verify the innate localization capability of different pre-trained models. Then, we propose CUTER (CUT-out-and-Experience-Replay), a simple yet versatile strategy that provides fine-grained supervision signals by further identifying, strengthening and cutting out label-specific regions for efficient experience replay. It not only enables models to simultaneously address catastrophic forgetting, missing labels, and class imbalance challenges, but also serves as an orthogonal solution that seamlessly integrates with existing approaches. Extensive experiments on multiple multi-label image benchmarks demonstrate the superiority of our proposed method. The code is available at \href{https://github.com/wxr99/Cut-Replay}{https://github.com/wxr99/Cut-Replay}

Cut out and Replay: A Simple yet Versatile Strategy for Multi-Label Online Continual Learning

TL;DR

Abstract

Cut out and Replay: A Simple yet Versatile Strategy for Multi-Label Online Continual Learning

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (8)

Theorems & Definitions (4)