Neighborhood Commonality-aware Evolution Network for Continuous Generalized Category Discovery
Ye Wang, Yaxiong Wang, Guoshuai Zhao, Xueming Qian
TL;DR
NCENet tackles Continuous Generalized Category Discovery (C-GCD) by decomposing the problem into learning discriminative representations for novel classes (NCRL) and preserving old-class knowledge (BCKD) in an incremental, unlabelled-data setting. NCRL leverages local neighborhood commonalities via μ_i = (1/k) sum_{q in NN(z_i)} z_q to form meaningful prediction distributions p over μ, guided by a self-distillation objective with L_ncrl = -1/|B| sum_i sum_j p_i^j log \, p_hat_i^j, enabling robust novel-class representation without requiring explicit labels. BCKD introduces a bi-level contrastive distillation framework with student-anchored and teacher-anchored losses L_sa and L_ta, and sets L_bckd = (L_sa + L_ta)/2 to balance plasticity and stability during incremental learning. Experiments on CIFAR10, CIFAR100 and Tiny-ImageNet show NCENet achieving state-of-the-art clustering accuracy, especially in the last incremental sessions, validating the effectiveness of neighborhood-based representation learning and contrastive knowledge transfer for continual generalized category discovery.
Abstract
Continuous Generalized Category Discovery (C-GCD) aims to continually discover novel classes from unlabelled image sets while maintaining performance on old classes. In this paper, we propose a novel learning framework, dubbed Neighborhood Commonality-aware Evolution Network (NCENet) that conquers this task from the perspective of representation learning. Concretely, to learn discriminative representations for novel classes, a Neighborhood Commonality-aware Representation Learning (NCRL) is designed, which exploits local commonalities derived neighborhoods to guide the learning of representational differences between instances of different classes. To maintain the representation ability for old classes, a Bi-level Contrastive Knowledge Distillation (BCKD) module is designed, which leverages contrastive learning to perceive the learning and learned knowledge and conducts knowledge distillation. Extensive experiments conducted on CIFAR10, CIFAR100, and Tiny-ImageNet demonstrate the superior performance of NCENet compared to the previous state-of-the-art method. Particularly, in the last incremental learning session on CIFAR100, the clustering accuracy of NCENet outperforms the second-best method by a margin of 3.09\% on old classes and by a margin of 6.32\% on new classes. Our code will be publicly available at \href{https://github.com/xjtuYW/NCENet.git}{https://github.com/xjtuYW/NCENet.git}. \end{abstract}
