Category Adaptation Meets Projected Distillation in Generalized Continual Category Discovery
Grzegorz Rypeść, Daniel Marczak, Sebastian Cygert, Tomasz Trzciński, Bartłomiej Twardowski
TL;DR
This work addresses Generalized Continual Category Discovery (GCCD), a setting requiring learning from sequential, partially labeled data while discovering novel categories. It introduces CAMP, a method that pairs projected distillation via a learnable projector with a centroid-drift predictor (an auxiliary category adaptation network) to model and compensate for past-class drift, enabling robust plasticity and stability without exemplars. CAMP demonstrates state-of-the-art performance across GCCD and exemplar-free Class Incremental Learning on multiple datasets, with analyses showing the benefits of combining projection-based distillation and centroid adaptation and clarifying the roles of adapters and distillers. The approach offers a practical, scalable solution for continual learning with partially labeled data and evolving category distributions, albeit with limitations for non-centroid-based representations and exemplar-free scalability in certain settings.
Abstract
Generalized Continual Category Discovery (GCCD) tackles learning from sequentially arriving, partially labeled datasets while uncovering new categories. Traditional methods depend on feature distillation to prevent forgetting the old knowledge. However, this strategy restricts the model's ability to adapt and effectively distinguish new categories. To address this, we introduce a novel technique integrating a learnable projector with feature distillation, thus enhancing model adaptability without sacrificing past knowledge. The resulting distribution shift of the previously learned categories is mitigated with the auxiliary category adaptation network. We demonstrate that while each component offers modest benefits individually, their combination - dubbed CAMP (Category Adaptation Meets Projected distillation) - significantly improves the balance between learning new information and retaining old. CAMP exhibits superior performance across several GCCD and Class Incremental Learning scenarios. The code is available at https://github.com/grypesc/CAMP.
