ProtoGCD: Unified and Unbiased Prototype Learning for Generalized Category Discovery

Shijie Ma; Fei Zhu; Xu-Yao Zhang; Cheng-Lin Liu

ProtoGCD: Unified and Unbiased Prototype Learning for Generalized Category Discovery

Shijie Ma, Fei Zhu, Xu-Yao Zhang, Cheng-Lin Liu

TL;DR

ProtoGCD tackles generalized category discovery by unifying old and new classes under a shared prototypical classifier and end-to-end learning. It introduces dual-level adaptive pseudo-labeling (DAPL), entropy-based regularization, and a prototype separation term to learn unbiased, discriminative representations while avoiding confirmation bias. A practical Prototype Score criterion estimates the number of novel classes, and the framework extends to unseen outlier detection for open-world applicability. Across generic and fine-grained datasets, ProtoGCD achieves state-of-the-art performance with balanced old/new accuracy and strong OOD detection capabilities, underscoring the value of unified prototype learning for open-world clustering and classification.

Abstract

Generalized category discovery (GCD) is a pragmatic but underexplored problem, which requires models to automatically cluster and discover novel categories by leveraging the labeled samples from old classes. The challenge is that unlabeled data contain both old and new classes. Early works leveraging pseudo-labeling with parametric classifiers handle old and new classes separately, which brings about imbalanced accuracy between them. Recent methods employing contrastive learning neglect potential positives and are decoupled from the clustering objective, leading to biased representations and sub-optimal results. To address these issues, we introduce a unified and unbiased prototype learning framework, namely ProtoGCD, wherein old and new classes are modeled with joint prototypes and unified learning objectives, {enabling unified modeling between old and new classes}. Specifically, we propose a dual-level adaptive pseudo-labeling mechanism to mitigate confirmation bias, together with two regularization terms to collectively help learn more suitable representations for GCD. Moreover, for practical considerations, we devise a criterion to estimate the number of new classes. Furthermore, we extend ProtoGCD to detect unseen outliers, achieving task-level unification. Comprehensive experiments show that ProtoGCD achieves state-of-the-art performance on both generic and fine-grained datasets. The code is available at https://github.com/mashijie1028/ProtoGCD.

ProtoGCD: Unified and Unbiased Prototype Learning for Generalized Category Discovery

TL;DR

Abstract

ProtoGCD: Unified and Unbiased Prototype Learning for Generalized Category Discovery

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (16)

Theorems & Definitions (7)