Consistent Supervised-Unsupervised Alignment for Generalized Category Discovery
Jizhou Han, Shaokun Wang, Yuhang He, Chenhao Ding, Qiang Wang, Xinyuan Gao, SongLin Dong, Yihong Gong
TL;DR
NC-GCD addresses inconsistent objectives and category confusion in Generalized Category Discovery by fixing a Simplex ETF prototype space and enforcing consistent alignment between supervised labels and unsupervised clusters. A Consistent ETF Alignment Loss combines supervised and unsupervised ETF alignment, while the Semantic Consistency Matcher stabilizes cluster-label mappings across iterations, yielding a Geometry-driven, NC-inspired feature space for both known and novel categories. Across six benchmarks and multiple backbones, NC-GCD achieves state-of-the-art performance, especially in novel-category accuracy, and demonstrates robustness to K estimation and clustering dynamics. The work provides a principled, efficient framework for open-world category learning with strong practical impact for scalable recognition and discovery.
Abstract
Generalized Category Discovery (GCD) focuses on classifying known categories while simultaneously discovering novel categories from unlabeled data. However, previous GCD methods face challenges due to inconsistent optimization objectives and category confusion. This leads to feature overlap and ultimately hinders performance on novel categories. To address these issues, we propose the Neural Collapse-inspired Generalized Category Discovery (NC-GCD) framework. By pre-assigning and fixing Equiangular Tight Frame (ETF) prototypes, our method ensures an optimal geometric structure and a consistent optimization objective for both known and novel categories. We introduce a Consistent ETF Alignment Loss that unifies supervised and unsupervised ETF alignment and enhances category separability. Additionally, a Semantic Consistency Matcher (SCM) is designed to maintain stable and consistent label assignments across clustering iterations. Our method achieves strong performance on multiple GCD benchmarks, significantly enhancing novel category accuracy and demonstrating its effectiveness.
