Proxy-Anchor and EVT-Driven Continual Learning Method for Generalized Category Discovery
Alireza Fathalizadeh, Roozbeh Razavi-Far
TL;DR
This work tackles Continual Generalized Category Discovery (CGCD) by introducing EVT-guided boundaries around proxy anchors and a dedicated EVT-based loss to simultaneously reject unknowns and improve representation. The method, named CATEGORIZER, comprises an Initial Stage that leverages Proxy Anchor loss followed by Weibull-based EVT analysis to define a probabilistic inclusion boundary and a subsequent $\,\ell_{evt}$ loss, and a Continual Learning Stage that performs novelty detection, clustering of unknowns into novel classes, and incremental proxy updates with memory and distillation while applying model reduction to discard redundant proxies. Key contributions include the EVT-based loss for representation learning, a threshold-based novelty detection and clustering pipeline, and a greedy set-cover proxy reduction to mitigate overestimation of novel categories. Experiments on multiple fine-grained datasets demonstrate superior performance over SOTA CGCD methods, validating the approach's effectiveness in mitigating forgetting while enabling robust discovery. The work has practical impact for real-world systems requiring open-set recognition and continual adaptation without extensive labeled data. Future work includes integrating the EVT loss into the continual stage and exploring alternative clustering strategies.
Abstract
Continual generalized category discovery has been introduced and studied in the literature as a method that aims to continuously discover and learn novel categories in incoming data batches while avoiding catastrophic forgetting of previously learned categories. A key component in addressing this challenge is the model's ability to separate novel samples, where Extreme Value Theory (EVT) has been effectively employed. In this work, we propose a novel method that integrates EVT with proxy anchors to define boundaries around proxies using a probability of inclusion function, enabling the rejection of unknown samples. Additionally, we introduce a novel EVT-based loss function to enhance the learned representation, achieving superior performance compared to other deep-metric learning methods in similar settings. Using the derived probability functions, novel samples are effectively separated from previously known categories. However, category discovery within these novel samples can sometimes overestimate the number of new categories. To mitigate this issue, we propose a novel EVT-based approach to reduce the model size and discard redundant proxies. We also incorporate experience replay and knowledge distillation mechanisms during the continual learning stage to prevent catastrophic forgetting. Experimental results demonstrate that our proposed approach outperforms state-of-the-art methods in continual generalized category discovery scenarios.
