CALA: A Class-Aware Logit Adapter for Few-Shot Class-Incremental Learning
Chengyan Liu, Linglan Zhao, Fan Lyu, Kaile Du, Fuyuan Hu, Tao Zhou
TL;DR
This work tackles Few-Shot Class-Incremental Learning (FSCIL) where a backbone trained on base data is frozen, causing base-class bias and novel-class confusion. It introduces Class-Aware Logit Adapter (CALA), a lightweight classifier-level module learned through pseudo-incremental training that outputs per-class balancing factors $\boldsymbol{\beta}$ from base–novel similarities and applies them to final logits via $\hat{\mathbf{z}}=\mathbf{z}+[\mathbf{0}, \gamma\boldsymbol{\beta}]$, enabling dynamic, class-aware calibration. The balancing factors are produced by an MLP $g_\phi(\cdot)$ fed with a cosine-based similarity matrix $\tilde{\mathbf{S}}_c$ between fake novel prototypes and base prototypes, and CALA is designed as a plug-and-play module that preserves the feature space. Empirical results on mini-ImageNet, CIFAR-100, and CUB-200 show CALA consistently outperforms SOTAs, reduces novel-to-base misclassification, and generalizes across different FSCIL baselines, demonstrating practical impact in reducing novel-class confusion without retraining the backbone.
Abstract
Few-Shot Class-Incremental Learning (FSCIL) defines a practical but challenging task where models are required to continuously learn novel concepts with only a few training samples. Due to data scarcity, existing FSCIL methods resort to training a backbone with abundant base data and then keeping it frozen afterward. However, the above operation often causes the backbone to overfit to base classes while overlooking the novel ones, leading to severe confusion between them. To address this issue, we propose Class-Aware Logit Adapter (CALA). Our method involves a lightweight adapter that learns to rectify biased predictions through a pseudo-incremental learning paradigm. In the real FSCIL process, we use the learned adapter to dynamically generate robust balancing factors. These factors can adjust confused novel instances back to their true label space based on their similarity to base classes. Specifically, when confusion is more likely to occur in novel instances that closely resemble base classes, greater rectification is required. Notably, CALA operates on the classifier level, preserving the original feature space, thus it can be flexibly plugged into most of the existing FSCIL works for improved performance. Experiments on three benchmark datasets consistently validate the effectiveness and flexibility of CALA. Codes will be available upon acceptance.
