Learn to Categorize or Categorize to Learn? Self-Coding for Generalized Category Discovery
Sarah Rastegar, Hazel Doughty, Cees G. M. Snoek
TL;DR
This work reconsiders the notion of a category by casting it as an optimization objective and develops InfoSieve, a self-supervised framework that learns binary-category codes forming an implicit hierarchical tree for generalized category discovery. The method combines information-theoretic objectives (via algorithmic and Shannon mutual information) with code-length minimization and supervision signals to extract compact, structured category representations from unlabeled data. Theoretical justifications link the learning of category codes to optimal trees under certain assumptions, and empirically InfoSieve achieves state-of-the-art results on fine-grained and open-world-like datasets, while providing interpretable hierarchical structure. By enabling test-time discovery of unknown categories with controllable granularity and without relying on fixed label sets, this approach offers a scalable pathway toward robust, human-agnostic categorization in real-world data.
Abstract
In the quest for unveiling novel categories at test time, we confront the inherent limitations of traditional supervised recognition models that are restricted by a predefined category set. While strides have been made in the realms of self-supervised and open-world learning towards test-time category discovery, a crucial yet often overlooked question persists: what exactly delineates a category? In this paper, we conceptualize a category through the lens of optimization, viewing it as an optimal solution to a well-defined problem. Harnessing this unique conceptualization, we propose a novel, efficient and self-supervised method capable of discovering previously unknown categories at test time. A salient feature of our approach is the assignment of minimum length category codes to individual data instances, which encapsulates the implicit category hierarchy prevalent in real-world datasets. This mechanism affords us enhanced control over category granularity, thereby equipping our model to handle fine-grained categories adeptly. Experimental evaluations, bolstered by state-of-the-art benchmark comparisons, testify to the efficacy of our solution in managing unknown categories at test time. Furthermore, we fortify our proposition with a theoretical foundation, providing proof of its optimality. Our code is available at https://github.com/SarahRastegar/InfoSieve.
