Simulations of Common Unsupervised Domain Adaptation Algorithms for Image Classification
Ahmad Chaddad, Yihang Wu, Yuchen Jiang, Ahmed Bouridane, Christian Desrosiers
TL;DR
This paper investigates unsupervised domain adaptation (UDA) for image classification by simulating and comparing a range of prevalent DA techniques on public datasets. It covers traditional, adversarial, and non-adversarial deep-learning approaches, emphasizing SSRT's strong Office-31 performance (83–92% range in simulations) and its sensitivity to batch size on Office-Home. Key contributions include a systematic cross-method evaluation, guidance on when to prefer certain backbones or training regimes, and insights into data quality, cross-dataset transfer, and medical-domain applicability. The study also discusses real-world challenges such as privacy constraints, the rise of foundation-model-based DA, self-supervised and test-time adaptation, and the need for robust OOD/domain drift handling, with public code available at the provided GitHub repository.
Abstract
Traditional machine learning assumes that training and test sets are derived from the same distribution; however, this assumption does not always hold in practical applications. This distribution disparity can lead to severe performance drops when the trained model is used in new data sets. Domain adaptation (DA) is a machine learning technique that aims to address this problem by reducing the differences between domains. This paper presents simulation-based algorithms of recent DA techniques, mainly related to unsupervised domain adaptation (UDA), where labels are available only in the source domain. Our study compares these techniques with public data sets and diverse characteristics, highlighting their respective strengths and drawbacks. For example, Safe Self-Refinement for Transformer-based DA (SSRT) achieved the highest accuracy (91.6\%) in the office-31 data set during our simulations, however, the accuracy dropped to 72.4\% in the Office-Home data set when using limited batch sizes. In addition to improving the reader's comprehension of recent techniques in DA, our study also highlights challenges and upcoming directions for research in this domain. The codes are available at https://github.com/AIPMLab/Domain_Adaptation.
