Singular Value Fine-tuning for Few-Shot Class-Incremental Learning
Zhiwu Wang, Yichen Wu, Renzhen Wang, Haokun Lin, Quanziang Wang, Qian Zhao, Deyu Meng
TL;DR
FSCIL combines continual learning with extreme data scarcity for new classes, making overfitting a critical risk when using large foundation models. The paper introduces SVFCL, a singular-value-based fine-tuning method that freezes the SVD bases $\mathbf{U}$ and $\mathbf{V}$ of pretrained weights and only learns to adjust and merge the singular values across incremental tasks, yielding updates $\Delta W = \mathbf{U} \mathcal{M}(\{\Delta\Sigma_i\}_{i=0}^{t-1}, \Delta\Sigma_t) \mathbf{V}^\top$. Theoretical and empirical analysis shows SVFCL uses far fewer trainable parameters and concentrates updates along principal components, reducing overfitting while maintaining strong knowledge retention, outperforming prompt-tuning and LoRA-based baselines on miniImageNet, ImageNet-R, and CUB200-2011. Ablation studies confirm the importance of freezing $\mathbf{U}$ and $\mathbf{V}$, the choice of blocks to fine-tune in ViT, and the efficacy of low-rank SVD approximations. Overall, SVFCL provides a simple, effective, and scalable approach for FSCIL with foundation models, achieving state-of-the-art results and robust generalization to distribution shifts.
Abstract
Class-Incremental Learning (CIL) aims to prevent catastrophic forgetting of previously learned classes while sequentially incorporating new ones. The more challenging Few-shot CIL (FSCIL) setting further complicates this by providing only a limited number of samples for each new class, increasing the risk of overfitting in addition to standard CIL challenges. While catastrophic forgetting has been extensively studied, overfitting in FSCIL, especially with large foundation models, has received less attention. To fill this gap, we propose the Singular Value Fine-tuning for FSCIL (SVFCL) and compared it with existing approaches for adapting foundation models to FSCIL, which primarily build on Parameter Efficient Fine-Tuning (PEFT) methods like prompt tuning and Low-Rank Adaptation (LoRA). Specifically, SVFCL applies singular value decomposition to the foundation model weights, keeping the singular vectors fixed while fine-tuning the singular values for each task, and then merging them. This simple yet effective approach not only alleviates the forgetting problem but also mitigates overfitting more effectively while significantly reducing trainable parameters. Extensive experiments on four benchmark datasets, along with visualizations and ablation studies, validate the effectiveness of SVFCL. The code will be made available.
