A Bag of Tricks for Few-Shot Class-Incremental Learning
Shuvendu Roy, Chunjong Park, Aldi Fahrezi, Ali Etemad
TL;DR
This work addresses FSCIL, a challenging continual learning setting with limited samples for new classes, by proposing a bag-of-tricks framework that unifies six techniques across stability, adaptability, and training. The approach combines SupCon-based stability, ETF prototype pre-assignment, pseudo-classes, Incremental SubNet Tuning, self-supervised pre-training, and a rotation pretext task within a baseline incremental-frozen structure. Empirically, it achieves state-of-the-art results on CIFAR-100, CUB-200, and miniImageNet, with notable improvements in stability (reduced forgetting) and adaptability (better novel-class learning), and demonstrates scalability to larger encoders and ImageNet-1K. The work provides a practical baseline and a comprehensive analysis of how these tricks interact to balance the stability–adaptability trade-off in FSCIL, with broader implications for data-scarce continual learning scenarios.
Abstract
We present a bag of tricks framework for few-shot class-incremental learning (FSCIL), which is a challenging form of continual learning that involves continuous adaptation to new tasks with limited samples. FSCIL requires both stability and adaptability, i.e., preserving proficiency in previously learned tasks while learning new ones. Our proposed bag of tricks brings together six key and highly influential techniques that improve stability, adaptability, and overall performance under a unified framework for FSCIL. We organize these tricks into three categories: stability tricks, adaptability tricks, and training tricks. Stability tricks aim to mitigate the forgetting of previously learned classes by enhancing the separation between the embeddings of learned classes and minimizing interference when learning new ones. On the other hand, adaptability tricks focus on the effective learning of new classes. Finally, training tricks improve the overall performance without compromising stability or adaptability. We perform extensive experiments on three benchmark datasets, CIFAR-100, CUB-200, and miniIMageNet, to evaluate the impact of our proposed framework. Our detailed analysis shows that our approach substantially improves both stability and adaptability, establishing a new state-of-the-art by outperforming prior works in the area. We believe our method provides a go-to solution and establishes a robust baseline for future research in this area.
