Learning Part Knowledge to Facilitate Category Understanding for Fine-Grained Generalized Category Discovery
Enguang Wang, Zhimao Peng, Zhengyuan Xie, Haori Lu, Fei Yang, Xialei Liu
TL;DR
PartGCD addresses the brittleness of fine-grained generalized category discovery by integrating explicit part knowledge into learning. It introduces Adaptive Part Decomposition to automatically extract class-specific parts and Part Discrepancy Regularization to encourage distinct part representations, combined with a calibrated prototype and GMM-based part modeling. The approach yields state-of-the-art results on multiple fine-grained benchmarks while remaining competitive on generic datasets, validating the importance of local, part-level cues for robust category understanding in unlabeled data. The work also provides theoretical grounding for Sinkhorn-based prototype calibration and comprehensive analyses, highlighting practical benefits in real-world-like, fine-grained discovery tasks.
Abstract
Generalized Category Discovery (GCD) aims to classify unlabeled data containing both seen and novel categories. Although existing methods perform well on generic datasets, they struggle in fine-grained scenarios. We attribute this difficulty to their reliance on contrastive learning over global image features to automatically capture discriminative cues, which fails to capture the subtle local differences essential for distinguishing fine-grained categories. Therefore, in this paper, we propose incorporating part knowledge to address fine-grained GCD, which introduces two key challenges: the absence of annotations for novel classes complicates the extraction of the part features, and global contrastive learning prioritizes holistic feature invariance, inadvertently suppressing discriminative local part patterns. To address these challenges, we propose PartGCD, including 1) Adaptive Part Decomposition, which automatically extracts class-specific semantic parts via Gaussian Mixture Models, and 2) Part Discrepancy Regularization, enforcing explicit separation between part features to amplify fine-grained local part distinctions. Experiments demonstrate state-of-the-art performance across multiple fine-grained benchmarks while maintaining competitiveness on generic datasets, validating the effectiveness and robustness of our approach.
