TEAL: New Selection Strategy for Small Buffers in Experience Replay Class Incremental Learning
Shahar Shaul-Ariel, Daphna Weinshall
TL;DR
The paper tackles catastrophic forgetting in class-incremental learning under tight replay-memory budgets by introducing TEAL, a typicality-based exemplar selection method that also ensures diversity via clustering in a learned embedding space. TEAL selects representative exemplars per class through an iterative, pace-driven process using K-Means to cover uncovered regions and a priority-removal scheme to preserve the most informative samples; it is designed as a modular add-on for existing ER-IL approaches. Across Split CIFAR-100, Split tinyImageNet, and Split CUB-200, TEAL yields state-of-the-art or near-state-of-the-art final accuracies, especially when the buffer is very small (1–3 exemplars per class), and remains beneficial across multiple architectures and task settings. The work provides practical impact for memory-constrained continual learning systems, and the authors provide code for reproducibility at their GitHub repository.
Abstract
Continual Learning is an unresolved challenge, whose relevance increases when considering modern applications. Unlike the human brain, trained deep neural networks suffer from a phenomenon called catastrophic forgetting, wherein they progressively lose previously acquired knowledge upon learning new tasks. To mitigate this problem, numerous methods have been developed, many relying on the replay of past exemplars during new task training. However, as the memory allocated for replay decreases, the effectiveness of these approaches diminishes. On the other hand, maintaining a large memory for the purpose of replay is inefficient and often impractical. Here we introduce TEAL, a novel approach to populate the memory with exemplars, that can be integrated with various experience-replay methods and significantly enhance their performance with small memory buffers. We show that TEAL enhances the average accuracy of existing class-incremental methods and outperforms other selection strategies, achieving state-of-the-art performance even with small memory buffers of 1-3 exemplars per class in the final task. This confirms our initial hypothesis that when memory is scarce, it is best to prioritize the most typical data. Code is available at this https URL: https://github.com/shahariel/TEAL.
