From Categories to Classifiers: Name-Only Continual Learning by Exploring the Web
Ameya Prabhu, Hasan Abed Al Kader Hammoud, Ser-Nam Lim, Bernard Ghanem, Philip H. S. Torr, Adel Bibi
TL;DR
The paper tackles the high cost of data labeling in continual learning by proposing name-only continual learning (NO-CL), which relies on uncurated webly-supervised data collected via simple queries (Category Name + Auxiliary Suffix). The core method, C2C, uses a fixed backbone to extract features from web-sourced data and trains a lightweight classifier under constrained computational budgets, enabling rapid, scalable adaptation across timesteps. Empirical results show web data can match or surpass manually annotated training in fine-grained tasks, with 2–25% absolute gains over prior name-only methods and strong performance in class-, domain-, and time-incremental settings; EvoTrends demonstrates real-world applicability by tracking evolving trends over 21 years. The work highlights substantial reductions in annotation time and cost (minutes, <$15 on AWS) while maintaining competitive accuracy, and introduces EvoTrends as a practical continual-name-only benchmark for evaluating adaptation to real-world trends. Overall, the approach reveals the viability of webly-supervised data as a robust substitute for manual labeling in continual learning and points to further extensions such as test-time adaptation and online continual learning.
Abstract
Continual Learning (CL) often relies on the availability of extensive annotated datasets, an assumption that is unrealistically time-consuming and costly in practice. We explore a novel paradigm termed name-only continual learning where time and cost constraints prohibit manual annotation. In this scenario, learners adapt to new category shifts using only category names without the luxury of annotated training data. Our proposed solution leverages the expansive and ever-evolving internet to query and download uncurated webly-supervised data for image classification. We investigate the reliability of our web data and find them comparable, and in some cases superior, to manually annotated datasets. Additionally, we show that by harnessing the web, we can create support sets that surpass state-of-the-art name-only classification that create support sets using generative models or image retrieval from LAION-5B, achieving up to 25% boost in accuracy. When applied across varied continual learning contexts, our method consistently exhibits a small performance gap in comparison to models trained on manually annotated datasets. We present EvoTrends, a class-incremental dataset made from the web to capture real-world trends, created in just minutes. Overall, this paper underscores the potential of using uncurated webly-supervised data to mitigate the challenges associated with manual data labeling in continual learning.
