Few-Shot Class-Incremental Learning with Prior Knowledge
Wenhao Jiang, Duo Li, Menghan Hu, Guangtao Zhai, Xiaokang Yang, Xiao-Ping Zhang
TL;DR
The paper tackles catastrophic forgetting and overfitting in FSCIL by introducing Learning with Prior Knowledge (LwPK), which pre-trains a model with unlabeled data from prospective classes using deep clustering to produce pseudo-labels and jointly train with base data. This forward-prior strategy creates a hybrid embedding space and reduces model updates during incremental phases, as supported by empirical risk minimization and class-distance analysis. Across CIFAR100, CUB200, and miniImageNet, LwPK demonstrates superior or competitive performance relative to strong FSCIL baselines and semi-supervised rivals, highlighting the value of leveraging unlabeled data as priors. The work includes extensive ablations and theoretical justification, and releases its code to facilitate replication and future exploration of forward priors in FSCIL.
Abstract
To tackle the issues of catastrophic forgetting and overfitting in few-shot class-incremental learning (FSCIL), previous work has primarily concentrated on preserving the memory of old knowledge during the incremental phase. The role of pre-trained model in shaping the effectiveness of incremental learning is frequently underestimated in these studies. Therefore, to enhance the generalization ability of the pre-trained model, we propose Learning with Prior Knowledge (LwPK) by introducing nearly free prior knowledge from a few unlabeled data of subsequent incremental classes. We cluster unlabeled incremental class samples to produce pseudo-labels, then jointly train these with labeled base class samples, effectively allocating embedding space for both old and new class data. Experimental results indicate that LwPK effectively enhances the model resilience against catastrophic forgetting, with theoretical analysis based on empirical risk minimization and class distance measurement corroborating its operational principles. The source code of LwPK is publicly available at: \url{https://github.com/StevenJ308/LwPK}.
