CASUAL: Conditional Support Alignment for Domain Adaptation with Label Shift
Anh T Nguyen, Lam Tran, Anh Tong, Tuan-Duy H. Nguyen, Toan Tran
TL;DR
We address unsupervised domain adaptation under label distribution shift, where source and target label distributions differ while conditional feature relations can be misaligned. The authors introduce CASUAL, a Conditional Adversarial SUpport ALignment framework that targets the supports of class-conditional feature representations using a conditional symmetric support divergence (CSSD) based target risk bound. The method integrates source supervision, target entropy minimization, Lipschitz regularization, and a pseudo-label guided joint-support alignment via an adversarial discriminator, leading to a bound-driven training objective. Empirical results across USPS→MNIST, STL→CIFAR, and VisDA-2017 demonstrate robust performance under varying label shift, with ablations confirming the roles of each loss term and visualizations illustrating improved class separation and reduced CSSD. Overall, CASUAL offers a principled, label-shift-robust approach to learning discriminative representations without explicitly estimating the shifted label distribution.
Abstract
Unsupervised domain adaptation (UDA) refers to a domain adaptation framework in which a learning model is trained based on the labeled samples on the source domain and unlabeled ones in the target domain. The dominant existing methods in the field that rely on the classical covariate shift assumption to learn domain-invariant feature representation have yielded suboptimal performance under label distribution shift. In this paper, we propose a novel Conditional Adversarial SUpport ALignment (CASUAL) whose aim is to minimize the conditional symmetric support divergence between the source's and target domain's feature representation distributions, aiming at a more discriminative representation for the classification task. We also introduce a novel theoretical target risk bound, which justifies the merits of aligning the supports of conditional feature distributions compared to the existing marginal support alignment approach in the UDA settings. We then provide a complete training process for learning in which the objective optimization functions are precisely based on the proposed target risk bound. Our empirical results demonstrate that CASUAL outperforms other state-of-the-art methods on different UDA benchmark tasks under different label shift conditions.
