Rethinking Few-shot Class-incremental Learning: Learning from Yourself
Yu-Ming Tang, Yi-Xing Peng, Jingke Meng, Wei-Shi Zheng
TL;DR
This work addresses the evaluation bias in FSCIL by introducing generalized average accuracy ($gAcc$), a parameterized metric that balances base and novel-class performance and is summarized via the area under its curve (AUC) across $\alpha$. It also proposes a ViT-based framework with a lightweight Feature Rectification (FR) module that leverages intermediate-layer representations through two relation-transfer losses (IR and CR) and multi-layer knowledge ensembles to improve novel-class generalization. The approach yields strong results across three FSCIL benchmarks (CIFAR-100, miniImageNet, CUB-200), with notable gains in $gAcc$ while maintaining competitive $aAcc$, and is supported by extensive ablations and corner-case analyses. The work provides a practical framework and evaluation toolkit for balancing base-novel performance in continual learning, with publicly available code to foster reproducibility and comparability.
Abstract
Few-shot class-incremental learning (FSCIL) aims to learn sequential classes with limited samples in a few-shot fashion. Inherited from the classical class-incremental learning setting, the popular benchmark of FSCIL uses averaged accuracy (aAcc) and last-task averaged accuracy (lAcc) as the evaluation metrics. However, we reveal that such evaluation metrics may not provide adequate emphasis on the novel class performance, and the continual learning ability of FSCIL methods could be ignored under this benchmark. In this work, as a complement to existing metrics, we offer a new metric called generalized average accuracy (gAcc) which is designed to provide an extra equitable evaluation by incorporating different perspectives of the performance under the guidance of a parameter $α$. We also present an overall metric in the form of the area under the curve (AUC) along the $α$. Under the guidance of gAcc, we release the potential of intermediate features of the vision transformers to boost the novel-class performance. Taking information from intermediate layers which are less class-specific and more generalizable, we manage to rectify the final features, leading to a more generalizable transformer-based FSCIL framework. Without complex network designs or cumbersome training procedures, our method outperforms existing FSCIL methods at aAcc and gAcc on three datasets. See codes at https://github.com/iSEE-Laboratory/Revisting_FSCIL
