Few-Shot Class Incremental Learning with Attention-Aware Self-Adaptive Prompt
Chenxi Liu, Zhenyi Wang, Tianyi Xiong, Ruibo Chen, Yihan Wu, Junfeng Guo, Heng Huang
TL;DR
This work tackles Few-Shot Class-Incremental Learning by freezing a pretrained Vision Transformer backbone and introducing a prompt-based framework. It splits prompts into attention-aware task-invariant prompts (TIP) and self-adaptive task-specific prompts (TSP), with TIP enforcing consistent attention to reduce task-specific information and TSP generated through a prompt encoder guided by an Information Bottleneck objective. An EMA based p_avg and an anchor loss further promote generalization and discriminability, enabling effective knowledge transfer from base to new classes without rehearsal buffers. Empirical results on CIFAR100, CUB200-2011, and ImageNet-R demonstrate that ASP consistently outperforms state-of-the-art FSCIL and prompt-based CIL methods in learning new classes while halving forgetting on base classes. The approach provides a data-efficient, scalable solution for continual learning in vision tasks using fixed backbones and learned prompts.
Abstract
Few-Shot Class-Incremental Learning (FSCIL) models aim to incrementally learn new classes with scarce samples while preserving knowledge of old ones. Existing FSCIL methods usually fine-tune the entire backbone, leading to overfitting and hindering the potential to learn new classes. On the other hand, recent prompt-based CIL approaches alleviate forgetting by training prompts with sufficient data in each task. In this work, we propose a novel framework named Attention-aware Self-adaptive Prompt (ASP). ASP encourages task-invariant prompts to capture shared knowledge by reducing specific information from the attention aspect. Additionally, self-adaptive task-specific prompts in ASP provide specific information and transfer knowledge from old classes to new classes with an Information Bottleneck learning objective. In summary, ASP prevents overfitting on base task and does not require enormous data in few-shot incremental tasks. Extensive experiments on three benchmark datasets validate that ASP consistently outperforms state-of-the-art FSCIL and prompt-based CIL methods in terms of both learning new classes and mitigating forgetting.
