Continual Learning on a Diet: Learning from Sparsely Labeled Streams Under Constrained Computation
Wenxuan Zhang, Youssef Mohamed, Bernard Ghanem, Philip H. S. Torr, Adel Bibi, Mohamed Elhoseiny
TL;DR
This work introduces Continual Learning on a Diet, a budgeted semi-supervised CL setting with per-step compute constraints and sparse labels, and proposes DietCL as a simple, effective baseline that jointly learns from labeled and unlabeled data. DietCL allocates computation across unlabeled data, current labeled data, and a balanced buffer of past labeled data, using an MAE-based SSL term and a masked classification loss to prevent forgetting. Across large-scale benchmarks ImageNet10k, CLOC, and CGLM, DietCL outperforms both supervised CL and recent semi-supervised CL methods under the same budget, with gains of a few percentage points and robust behavior across varying budgets, label rates, and stream lengths. The results highlight the practical value of leveraging unlabeled data under tight compute budgets to improve generalization and stability in real-world continual learning scenarios.
Abstract
We propose and study a realistic Continual Learning (CL) setting where learning algorithms are granted a restricted computational budget per time step while training. We apply this setting to large-scale semi-supervised Continual Learning scenarios with sparse label rates. Previous proficient CL methods perform very poorly in this challenging setting. Overfitting to the sparse labeled data and insufficient computational budget are the two main culprits for such a poor performance. Our new setting encourages learning methods to effectively and efficiently utilize the unlabeled data during training. To that end, we propose a simple but highly effective baseline, DietCL, which utilizes both unlabeled and labeled data jointly. DietCL meticulously allocates computational budget for both types of data. We validate our baseline, at scale, on several datasets, e.g., CLOC, ImageNet10K, and CGLM, under constraint budget setups. DietCL outperforms, by a large margin, all existing supervised CL algorithms as well as more recent continual semi-supervised methods. Our extensive analysis and ablations demonstrate that DietCL is stable under a full spectrum of label sparsity, computational budget, and various other ablations.
