Annotation-Free Class-Incremental Learning
Hari Chandana Kuchibhotla, K S Ananth, Vineeth N Balasubramanian
TL;DR
The paper tackles annotation-free class-incremental learning by proposing CrossWorld-CL, which grounds learning in external world knowledge from ImageNet to guide unlabeled streams. It combines world-knowledge distillation, semantic expansion via LLMs, dual visual–semantic alignment between DINO and CLIP, and a privacy-preserving replay mechanism using ImageNet proxies. The approach employs a prompt-guided cross-domain framework with stage-wise losses, including $L_{DD}$, $L_{II}$, $L_{ID}$, and $L_{DI}$, plus KL regularization to stabilize prototypes across tasks. Across four diverse datasets, CrossWorld-CL outperforms CLIP-based, unlabeled, and conventional continual-learning baselines, with particularly strong gains on late tasks, highlighting the value of structured external knowledge for robust, annotation-free continual learning.
Abstract
Despite significant progress in continual learning ranging from architectural novelty to clever strategies for mitigating catastrophic forgetting most existing methods rest on a strong but unrealistic assumption the availability of labeled data throughout the learning process. In real-world scenarios, however, data often arrives sequentially and without annotations, rendering conventional approaches impractical. In this work, we revisit the fundamental assumptions of continual learning and ask: Can current systems adapt when labels are absent and tasks emerge incrementally over time? To this end, we introduce Annotation-Free Class-Incremental Learning (AFCIL), a more realistic and challenging paradigm where unlabeled data arrives continuously, and the learner must incrementally acquire new classes without any supervision. To enable effective learning under AFCIL, we propose CrossWorld CL, a Cross Domain World Guided Continual Learning framework that incorporates external world knowledge as a stable auxiliary source. The method retrieves semantically related ImageNet classes for each downstream category, maps downstream and ImageNet features through a cross domain alignment strategy and finally introduce a novel replay strategy. This design lets the model uncover semantic structure without annotations while keeping earlier knowledge intact. Across four datasets, CrossWorld-CL surpasses CLIP baselines and existing continual and unlabeled learning methods, underscoring the benefit of world knowledge for annotation free continual learning.
