Cross-Class Feature Augmentation for Class Incremental Learning
Taehoon Kim, Jaeyoo Park, Bohyung Han
TL;DR
This paper tackles catastrophic forgetting in class incremental learning by introducing Cross-Class Feature Augmentation (CCFA), a feature-space augmentation technique inspired by adversarial perturbations that exploits a previously learned classifier to augment old-task representations without changing architecture. CCFA perturbs current-task features toward target old classes, uses the old classifier to generate pseudo-labels, and trains on both original and augmented features within existing knowledge-distillation frameworks. The method yields consistent improvements on CIFAR-100 and ImageNet across multiple baselines, especially under extreme memory constraints, and is demonstrated to be compatible with other augmentations while adding only modest computational overhead. Overall, CCFA provides a practical, architecture-agnostic approach to stabilizing decision boundaries for old tasks in continual learning with limited exemplars, enhancing both stability and generalization.
Abstract
We propose a novel class incremental learning approach by incorporating a feature augmentation technique motivated by adversarial attacks. We employ a classifier learned in the past to complement training examples rather than simply play a role as a teacher for knowledge distillation towards subsequent models. The proposed approach has a unique perspective to utilize the previous knowledge in class incremental learning since it augments features of arbitrary target classes using examples in other classes via adversarial attacks on a previously learned classifier. By allowing the cross-class feature augmentations, each class in the old tasks conveniently populates samples in the feature space, which alleviates the collapse of the decision boundaries caused by sample deficiency for the previous tasks, especially when the number of stored exemplars is small. This idea can be easily incorporated into existing class incremental learning algorithms without any architecture modification. Extensive experiments on the standard benchmarks show that our method consistently outperforms existing class incremental learning methods by significant margins in various scenarios, especially under an environment with an extremely limited memory budget.
