Open-Set Domain Adaptation Under Background Distribution Shift: Challenges and A Provably Efficient Solution
Shravan Chaudhari, Yoav Wald, Suchi Saria
TL;DR
This work formalizes Open-Set Domain Adaptation under background distribution shift and introduces CoLOR, a scalable constrained-learning solution with finite-sample guarantees. It provides a theoretical and empirical comparison against domain-discriminator methods, showing CoLOR's superior novelty detection and overall OSDA robustness, especially when the novel class is rare. The approach leverages multiple novelty heads and a shared representation to jointly optimize known-class accuracy and novel-class recall, validated across image and text benchmarks with varying α. The results offer practical guidance for deploying robust OSDA systems in real-world, shift-prone environments and point to avenues for integrating stronger theoretical guarantees with large-scale foundation-model features.
Abstract
As we deploy machine learning systems in the real world, a core challenge is to maintain a model that is performant even as the data shifts. Such shifts can take many forms: new classes may emerge that were absent during training, a problem known as open-set recognition, and the distribution of known categories may change. Guarantees on open-set recognition are mostly derived under the assumption that the distribution of known classes, which we call the background distribution, is fixed. In this paper we develop CoLOR, a method that is guaranteed to solve open-set recognition even in the challenging case where the background distribution shifts. We prove that the method works under benign assumptions that the novel class is separable from the non-novel classes, and provide theoretical guarantees that it outperforms a representative baseline in a simplified overparameterized setting. We develop techniques to make CoLOR scalable and robust, and perform comprehensive empirical evaluations on image and text data. The results show that CoLOR significantly outperforms existing open-set recognition methods under background shift. Moreover, we provide new insights into how factors such as the size of the novel class influences performance, an aspect that has not been extensively explored in prior work.
