COD: Learning Conditional Invariant Representation for Domain Adaptation Regression
Hao-Ran Yang, Chuan-Xian Ren, You-Wei Luo
TL;DR
The paper tackles domain adaptation for regression with continuous labels by establishing a sufficiency theory: the cross-domain conditional discrepancy for $P_{X|Y}$ governs the generalization error. It introduces COD, a kernel-based metric that captures differences between conditional distributions via mean embeddings and covariance operators, and proves a zero COD implies identical conditional laws. Building on this, it proposes a COD-based representation learning framework with a discriminability-enhanced COD_mod term and a Kernel Gaussian Wasserstein (KGW) discrepancy, optimized together with a source regression loss $\mathcal{L}_\mathrm{src}$. Empirical results on dSprites, Biwi Kinect, and MPI3D show COD yields state-of-the-art performance and robust gains from conditional alignment, validating both the theory and the method's practical impact for real-world DAR tasks.
Abstract
Aiming to generalize the label knowledge from a source domain with continuous outputs to an unlabeled target domain, Domain Adaptation Regression (DAR) is developed for complex practical learning problems. However, due to the continuity problem in regression, existing conditional distribution alignment theory and methods with discrete prior, which are proven to be effective in classification settings, are no longer applicable. In this work, focusing on the feasibility problems in DAR, we establish the sufficiency theory for the regression model, which shows the generalization error can be sufficiently dominated by the cross-domain conditional discrepancy. Further, to characterize conditional discrepancy with continuous conditioning variable, a novel Conditional Operator Discrepancy (COD) is proposed, which admits the metric property on conditional distributions via the kernel embedding theory. Finally, to minimize the discrepancy, a COD-based conditional invariant representation learning model is proposed, and the reformulation is derived to show that reasonable modifications on moment statistics can further improve the discriminability of the adaptation model. Extensive experiments on standard DAR datasets verify the validity of theoretical results and the superiority over SOTA DAR methods.
