Contrastive Conditional Alignment based on Label Shift Calibration for Imbalanced Domain Adaptation
Xiaona Sun, Zhenyu Wu, Zhiqiang Zhan, Yang Ji
TL;DR
This work tackles imbalanced domain adaptation by addressing coexisting covariate and label shifts. It introduces CCA-LSC, a two-stage framework that first applies Contrastive Conditional Alignment to learn domain-invariant and class-discriminative features, then uses Label Shift Calibration to adjust target pseudo-labels according to per-class label shift via the metric $M_{ls}$. The method combines domain adversarial learning, weighted centroid alignment, and discriminative feature alignment with calibrated pseudo-labels to improve pseudo-label reliability and reduce misalignment, achieving state-of-the-art results on OfficeHome and DomainNet. Extensive ablations and analysis demonstrate the effectiveness of each component and the two-stage learning strategy, with practical implications for robust, real-world IDA where label distributions differ across domains.
Abstract
Many existing unsupervised domain adaptation (UDA) methods primarily focus on covariate shift, limiting their effectiveness in imbalanced domain adaptation (IDA) where both covariate shift and label shift coexist. Recent IDA methods have achieved promising results based on self-training using target pseudo labels. However, under the IDA scenarios, the classifier learned in the source domain will exhibit different decision bias from the target domain. It will potentially make target pseudo labels unreliable, and will further lead to error accumulation with incorrect class alignment. Thus, we propose contrastive conditional alignment based on label shift calibration (CCA-LSC) for IDA, to address both covariate shift and label shift. Initially, our contrastive conditional alignment resolve covariate shift to learn representations with domain invariance and class discriminability, which include domain adversarial learning, sample-weighted moving average centroid alignment and discriminative feature alignment. Subsequently, we estimate the probability distribution of the target domain, and calibrate target sample classification predictions based on label shift metrics to encourage labeling pseudo-labels more consistently with the distribution of real target data. Extensive experiments are conducted and demonstrate that our method outperforms existing UDA and IDA methods on benchmarks with both label shift and covariate shift. Our code is available at https://github.com/ysxcj-hub/CCA-LSC.
