Table of Contents
Fetching ...

Composing Novel Classes: A Concept-Driven Approach to Generalized Category Discovery

Chuyu Zhang, Peiyan Gu, Xueyang Yu, Xuming He

TL;DR

This work tackles generalized category discovery by introducing ConceptGCD, a three-stage, concept-driven framework that first learns known-class concepts with a covariance-promoting loss, then generates derivable concepts from these concepts via a generator layer, and finally learns underivable concepts with an expansion layer, a contrastive objective, and a concept score normalization to balance concept influence. The approach explicitly separates derivable and underivable concept learning to reduce noise from unlabeled data and to better leverage known-class knowledge in a richer representation space. Empirical results across six benchmarks with both ViT-based backbones show substantial gains over state-of-the-art methods, including notable improvements on novel class clustering and mixed known/novel settings, and maintain robustness when the number of novel clusters is unknown. The proposed covariance-driven concept diversification, explicit concept transfer, and normalization mechanism offer a practical baseline for future GCD research and open-world recognition, highlighting the value of concept-level transfer over joint encoder sharing.

Abstract

We tackle the generalized category discovery (GCD) problem, which aims to discover novel classes in unlabeled datasets by leveraging the knowledge of known classes. Previous works utilize the known class knowledge through shared representation spaces. Despite their progress, our analysis experiments show that novel classes can achieve impressive clustering results on the feature space of a known class pre-trained model, suggesting that existing methods may not fully utilize known class knowledge. To address it, we introduce a novel concept learning framework for GCD, named ConceptGCD, that categorizes concepts into two types: derivable and underivable from known class concepts, and adopts a stage-wise learning strategy to learn them separately. Specifically, our framework first extracts known class concepts by a known class pre-trained model and then produces derivable concepts from them by a generator layer with a covariance-augmented loss. Subsequently, we expand the generator layer to learn underivable concepts in a balanced manner ensured by a concept score normalization strategy and integrate a contrastive loss to preserve previously learned concepts. Extensive experiments on various benchmark datasets demonstrate the superiority of our approach over the previous state-of-the-art methods. Code will be available soon.

Composing Novel Classes: A Concept-Driven Approach to Generalized Category Discovery

TL;DR

This work tackles generalized category discovery by introducing ConceptGCD, a three-stage, concept-driven framework that first learns known-class concepts with a covariance-promoting loss, then generates derivable concepts from these concepts via a generator layer, and finally learns underivable concepts with an expansion layer, a contrastive objective, and a concept score normalization to balance concept influence. The approach explicitly separates derivable and underivable concept learning to reduce noise from unlabeled data and to better leverage known-class knowledge in a richer representation space. Empirical results across six benchmarks with both ViT-based backbones show substantial gains over state-of-the-art methods, including notable improvements on novel class clustering and mixed known/novel settings, and maintain robustness when the number of novel clusters is unknown. The proposed covariance-driven concept diversification, explicit concept transfer, and normalization mechanism offer a practical baseline for future GCD research and open-world recognition, highlighting the value of concept-level transfer over joint encoder sharing.

Abstract

We tackle the generalized category discovery (GCD) problem, which aims to discover novel classes in unlabeled datasets by leveraging the knowledge of known classes. Previous works utilize the known class knowledge through shared representation spaces. Despite their progress, our analysis experiments show that novel classes can achieve impressive clustering results on the feature space of a known class pre-trained model, suggesting that existing methods may not fully utilize known class knowledge. To address it, we introduce a novel concept learning framework for GCD, named ConceptGCD, that categorizes concepts into two types: derivable and underivable from known class concepts, and adopts a stage-wise learning strategy to learn them separately. Specifically, our framework first extracts known class concepts by a known class pre-trained model and then produces derivable concepts from them by a generator layer with a covariance-augmented loss. Subsequently, we expand the generator layer to learn underivable concepts in a balanced manner ensured by a concept score normalization strategy and integrate a contrastive loss to preserve previously learned concepts. Extensive experiments on various benchmark datasets demonstrate the superiority of our approach over the previous state-of-the-art methods. Code will be available soon.

Paper Structure

This paper contains 43 sections, 9 equations, 9 figures, 18 tables.

Figures (9)

  • Figure 1: Generate derivable concepts. The linear layer and classifier are trained on novel and known class data with $\mathcal{L}_{base}$wen2022simgcd defined in Eq. \ref{['eq:base_loss']}.
  • Figure 2: The overview of our novel ConceptGCD learning framework. Car, Bird, and Cat, Tree represent known and novel classes, respectively. 'Cls' denotes the classifier, and the circle in each image (left) represents the concepts present in the corresponding data. Our framework consists of three training stages. In the first stage (top left), we fine-tune an encoder using labeled known class data to learn known class concepts. In the second stage(top right), we train a generator layer (GL) that can derive concepts from known class concepts. The final stage (bottom) introduces an expansion layer (EL), which builds upon the GL by increasing its dimensionality and incorporating a concept score normalization technique. Both the encoder and the EL are concurrently trained to learn novel concepts while preserving previously learned concepts, guided by the loss function $\mathcal{L}_{smi}$. The concept shown above is for understanding, and the learned concepts are visualized in Appendix \ref{['concept_vis']}.
  • Figure 3: t-SNE visualization on CIFAR100-80. More visualization are in Appendix \ref{['mtsne_vis']}.
  • Figure 4: Concept Visualization of Known Class Pre-Trained Model, Generator Layer, and Final Model on CIFAR100-80 and CUB. Attention maps for selected concepts are generated using Grad-CAMselvaraju2017grad, with model scores provided for each concept. Additionally, $\star$ denotes the highest score among all concepts, while $\times$ on the attention map indicates the absence of the model response to that concept. This behavior is exclusive to models utilizing ReLU activation (Generator Layer and Final Model). The blanks in the figure are caused by the fact that the number of concepts learned by the known class pre-trained Model, generator Layer, and final model are different.
  • Figure 5: Concept Visualization of Known Class Pre-Trained Model, Generator Layer, and Final Model on Stanford Cars.
  • ...and 4 more figures