Consistent Supervised-Unsupervised Alignment for Generalized Category Discovery

Jizhou Han; Shaokun Wang; Yuhang He; Chenhao Ding; Qiang Wang; Xinyuan Gao; SongLin Dong; Yihong Gong

Consistent Supervised-Unsupervised Alignment for Generalized Category Discovery

Jizhou Han, Shaokun Wang, Yuhang He, Chenhao Ding, Qiang Wang, Xinyuan Gao, SongLin Dong, Yihong Gong

TL;DR

NC-GCD addresses inconsistent objectives and category confusion in Generalized Category Discovery by fixing a Simplex ETF prototype space and enforcing consistent alignment between supervised labels and unsupervised clusters. A Consistent ETF Alignment Loss combines supervised and unsupervised ETF alignment, while the Semantic Consistency Matcher stabilizes cluster-label mappings across iterations, yielding a Geometry-driven, NC-inspired feature space for both known and novel categories. Across six benchmarks and multiple backbones, NC-GCD achieves state-of-the-art performance, especially in novel-category accuracy, and demonstrates robustness to K estimation and clustering dynamics. The work provides a principled, efficient framework for open-world category learning with strong practical impact for scalable recognition and discovery.

Abstract

Generalized Category Discovery (GCD) focuses on classifying known categories while simultaneously discovering novel categories from unlabeled data. However, previous GCD methods face challenges due to inconsistent optimization objectives and category confusion. This leads to feature overlap and ultimately hinders performance on novel categories. To address these issues, we propose the Neural Collapse-inspired Generalized Category Discovery (NC-GCD) framework. By pre-assigning and fixing Equiangular Tight Frame (ETF) prototypes, our method ensures an optimal geometric structure and a consistent optimization objective for both known and novel categories. We introduce a Consistent ETF Alignment Loss that unifies supervised and unsupervised ETF alignment and enhances category separability. Additionally, a Semantic Consistency Matcher (SCM) is designed to maintain stable and consistent label assignments across clustering iterations. Our method achieves strong performance on multiple GCD benchmarks, significantly enhancing novel category accuracy and demonstrating its effectiveness.

Consistent Supervised-Unsupervised Alignment for Generalized Category Discovery

TL;DR

Abstract

Consistent Supervised-Unsupervised Alignment for Generalized Category Discovery

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (12)