UDA-Bench: Revisiting Common Assumptions in Unsupervised Domain Adaptation Using a Standardized Framework
Tarun Kalluri, Sreyas Ravichandran, Manmohan Chandraker
TL;DR
The paper introduces UDA-Bench, a standardized PyTorch framework for fair, cross-method evaluation of unsupervised domain adaptation (UDA). Through a large-scale empirical study, it shows that adaptation gains shrink with stronger backbones, unlabeled target data provides diminishing returns, and pre-training data significantly shapes downstream adaptation in both supervised and self-supervised settings. It analyzes backbone architectures, unlabeled data volume, and pre-training data across diverse datasets, revealing that newer vision-transformer backbones improve cross-domain robustness but often reduce the relative benefits of UDA methods, while in-task pre-training yields substantial improvements. The work challenges some conventional beliefs about unlabeled data efficiency and underscores the need for standardized benchmarks, providing practical guidance for researchers and practitioners and contributing open-source resources for future UDA research.
Abstract
In this work, we take a deeper look into the diverse factors that influence the efficacy of modern unsupervised domain adaptation (UDA) methods using a large-scale, controlled empirical study. To facilitate our analysis, we first develop UDA-Bench, a novel PyTorch framework that standardizes training and evaluation for domain adaptation enabling fair comparisons across several UDA methods. Using UDA-Bench, our comprehensive empirical study into the impact of backbone architectures, unlabeled data quantity, and pre-training datasets reveals that: (i) the benefits of adaptation methods diminish with advanced backbones, (ii) current methods underutilize unlabeled data, and (iii) pre-training data significantly affects downstream adaptation in both supervised and self-supervised settings. In the context of unsupervised adaptation, these observations uncover several novel and surprising properties, while scientifically validating several others that were often considered empirical heuristics or practitioner intuitions in the absence of a standardized training and evaluation framework. The UDA-Bench framework and trained models are publicly available at https://github.com/ViLab-UCSD/UDABench_ECCV2024.
