Cross-domain Open-world Discovery
Shuo Wen, Maria Brbic
TL;DR
The paper addresses the problem of discovering unseen classes under domain shift while labeling seen classes, proposing a Cross-domain Open-world Discovery (CD-OWD) setting. It introduces CROW, a cluster-then-match prototype-based method that leverages the structured latent space of foundation models to cluster target data into target prototypes, match them to seen prototypes with a robust many-to-many mapping, and fine-tune the representation via a combined cross-entropy and entropy-regularization objective. Through extensive experiments on Office, OfficeHome, VisDA, and DomainNet across 75 settings, CROW consistently outperforms open-world SSL and universal domain adaptation baselines, achieving about an 8% improvement in the H-score on average and demonstrating robustness to threshold choices and unknown numbers of novel classes. The approach highlights the practical impact of using foundation-model representations for cross-domain open-world discovery and offers a scalable, flexible framework for simultaneous seen-class recognition and novel-class discovery in real-world, domain-shifted environments.
Abstract
In many real-world applications, test data may commonly exhibit categorical shifts, characterized by the emergence of novel classes, as well as distribution shifts arising from feature distributions different from the ones the model was trained on. However, existing methods either discover novel classes in the open-world setting or assume domain shifts without the ability to discover novel classes. In this work, we consider a cross-domain open-world discovery setting, where the goal is to assign samples to seen classes and discover unseen classes under a domain shift. To address this challenging problem, we present CROW, a prototype-based approach that introduces a cluster-then-match strategy enabled by a well-structured representation space of foundation models. In this way, CROW discovers novel classes by robustly matching clusters with previously seen classes, followed by fine-tuning the representation space using an objective designed for cross-domain open-world discovery. Extensive experimental results on image classification benchmark datasets demonstrate that CROW outperforms alternative baselines, achieving an 8% average performance improvement across 75 experimental settings.
