Unified Source-Free Domain Adaptation
Song Tang, Wenxin Su, Mao Ye, Boyu Wang, Xiatian Zhu
TL;DR
This work tackles the practical challenge of Source-Free Domain Adaptation (SFDA) across Closed-set, Open-set, Partial-set, and Generalized settings by proposing Unified SFDA. It introduces CausalDA, a causality-guided method that discovers latent external and internal causal factors in the logit space, leveraging a frozen Vision-Language model (CLIP) and a self-supervised information bottleneck to achieve robust cross-domain adaptation without target-domain supervision. The approach yields state-of-the-art results across diverse SFDA benchmarks and demonstrates strong generalization to source-free OOD scenarios, supported by extensive ablations and analyses. The framework promises improved deployability in privacy-sensitive or data-restricted environments by relying on causal factors rather than solely statistical associations.
Abstract
In the pursuit of transferring a source model to a target domain without access to the source training data, Source-Free Domain Adaptation (SFDA) has been extensively explored across various scenarios, including Closed-set, Open-set, Partial-set, and Generalized settings. Existing methods, focusing on specific scenarios, not only address a limited subset of challenges but also necessitate prior knowledge of the target domain, significantly limiting their practical utility and deployability. In light of these considerations, we introduce a more practical yet challenging problem, termed unified SFDA, which comprehensively incorporates all specific scenarios in a unified manner. In this paper, we propose a novel approach latent Causal factors discovery for unified SFDA (CausalDA). In contrast to previous alternatives that emphasize learning the statistical description of reality, we formulate CausalDA from a causality perspective. The objective is to uncover potential causality between latent variables and model decisions, enhancing the reliability and robustness of the learned model against domain shifts. To integrate extensive world knowledge, we leverage a pre-trained vision-language model such as CLIP. This aids in the formation and discovery of latent causal factors in the absence of supervision in the variation of distribution and semantics, coupled with a newly designed information bottleneck with theoretical guarantees. Extensive experiments demonstrate that CausalDA can achieve new state-of-the-art results in distinct SFDA settings, as well as source-free out-of-distribution generalization. Our code and data are available at https://github.com/tntek/CausalDA.
