Controllable Forgetting Mechanism for Few-Shot Class-Incremental Learning
Kirill Paramonov, Mete Ozay, Eunju Yang, Jijoong Moon, Umberto Michieli
TL;DR
The paper tackles the challenge of catastrophic forgetting in one-shot Few-Shot Class-Incremental Learning (OSCIL) for on-device personalization. It introduces Novel Class Detection (NCD), a threshold-based inference rule that partitions decisions between base and novel prototypes and allows controllable forgetting by selecting a distance threshold $\alpha$ to meet a forgetting budget. Through extensive experiments across backbones and datasets, the authors show that NCD yields substantial gains in novel-class accuracy (NCR) while keeping base-class forgetting within predefined limits, with notable improvements in ultra-low-shot regimes (e.g., 1-shot, 1 novel class). The work highlights the practical impact of predictable QoS on-device, and demonstrates that the approach is compatible with various base-training strategies and can even support OOD detection as a by-product. Overall, the proposed NCD framework provides a simple, effective, and adaptable solution to balance continual adaptation and memory of base knowledge in OSCIL.
Abstract
Class-incremental learning in the context of limited personal labeled samples (few-shot) is critical for numerous real-world applications, such as smart home devices. A key challenge in these scenarios is balancing the trade-off between adapting to new, personalized classes and maintaining the performance of the model on the original, base classes. Fine-tuning the model on novel classes often leads to the phenomenon of catastrophic forgetting, where the accuracy of base classes declines unpredictably and significantly. In this paper, we propose a simple yet effective mechanism to address this challenge by controlling the trade-off between novel and base class accuracy. We specifically target the ultra-low-shot scenario, where only a single example is available per novel class. Our approach introduces a Novel Class Detection (NCD) rule, which adjusts the degree of forgetting a priori while simultaneously enhancing performance on novel classes. We demonstrate the versatility of our solution by applying it to state-of-the-art Few-Shot Class-Incremental Learning (FSCIL) methods, showing consistent improvements across different settings. To better quantify the trade-off between novel and base class performance, we introduce new metrics: NCR@2FOR and NCR@5FOR. Our approach achieves up to a 30% improvement in novel class accuracy on the CIFAR100 dataset (1-shot, 1 novel class) while maintaining a controlled base class forgetting rate of 2%.
