CiPR: An Efficient Framework with Cross-instance Positive Relations for Generalized Category Discovery
Shaozhe Hao, Kai Han, Kwan-Yee K. Wong
TL;DR
CiPR tackles generalized category discovery in open-world settings with unknown unseen class counts by introducing cross-instance positive relations into contrastive learning. The framework hinges on SNC, a semi-supervised hierarchical clustering method that generates reliable pseudo-labels linking labelled and unlabelled data and enabling efficient one-shot class-number estimation via a joint reference score. With SNC-driven pseudo-labels, CiPR jointly optimizes two contrastive losses to learn balanced representations across seen and unseen classes, attaining state-of-the-art results on both generic and fine-grained datasets. The approach offers substantial gains in both accuracy and efficiency, notably in estimating the unknown class number with a single SNC run and without a predefined candidate set, making it practical for real-world open-world discovery.
Abstract
We tackle the issue of generalized category discovery (GCD). GCD considers the open-world problem of automatically clustering a partially labelled dataset, in which the unlabelled data may contain instances from both novel categories and labelled classes. In this paper, we address the GCD problem with an unknown category number for the unlabelled data. We propose a framework, named CiPR, to bootstrap the representation by exploiting Cross-instance Positive Relations in the partially labelled data for contrastive learning, which have been neglected in existing methods. To obtain reliable cross-instance relations to facilitate representation learning, we introduce a semi-supervised hierarchical clustering algorithm, named selective neighbor clustering (SNC), which can produce a clustering hierarchy directly from the connected components of a graph constructed from selective neighbors. We further present a method to estimate the unknown class number using SNC with a joint reference score that considers clustering indexes of both labelled and unlabelled data, and extend SNC to allow label assignment for the unlabelled instances with a given class number. We thoroughly evaluate our framework on public generic image recognition datasets and challenging fine-grained datasets, and establish a new state-of-the-art. Code: https://github.com/haoosz/CiPR
