Table of Contents
Fetching ...

Two stages domain invariant representation learners solve the large co-variate shift in unsupervised domain adaptation with two dimensional data domains

Hisashi Oshima, Tsuyoshi Ishizone, Tomoyuki Higuchi

TL;DR

The paper tackles unsupervised domain adaptation under large two-dimensional covariate shifts by introducing a two-stage, end-to-end domain invariant representation learning framework that leverages an unsupervised intermediate domain to bridge source, intermediate, and target distributions. A key feature is the L_propose loss, decomposing domain alignment across two adjacent domain pairs, which improves convergence and discriminability on the target. The authors also derive an RV-based theorem for unsupervised hyperparameter tuning, enabling parameter selection without target labels. Empirical results across four datasets and 38 tasks show that the proposed method consistently outperforms baselines (Step-by-step and Normal) and adheres to an upper-bound ordering, especially under larger shifts, with both quantitative gains and qualitative evidence of robust domain invariance and target discriminability. The approach is framed as broadly applicable, with potential extensions including synthetic intermediate domains and integration with other UDA techniques, making it practically impactful for real-world transfer learning scenarios.

Abstract

Recent developments in the unsupervised domain adaptation (UDA) enable the unsupervised machine learning (ML) prediction for target data, thus this will accelerate real world applications with ML models such as image recognition tasks in self-driving. Researchers have reported the UDA techniques are not working well under large co-variate shift problems where e.g. supervised source data consists of handwritten digits data in monotone color and unsupervised target data colored digits data from the street view. Thus there is a need for a method to resolve co-variate shift and transfer source labelling rules under this dynamics. We perform two stages domain invariant representation learning to bridge the gap between source and target with semantic intermediate data (unsupervised). The proposed method can learn domain invariant features simultaneously between source and intermediate also intermediate and target. Finally this achieves good domain invariant representation between source and target plus task discriminability owing to source labels. This induction for the gradient descent search greatly eases learning convergence in terms of classification performance for target data even when large co-variate shift. We also derive a theorem for measuring the gap between trained models and unsupervised target labelling rules, which is necessary for the free parameters optimization. Finally we demonstrate that proposing method is superiority to previous UDA methods using 4 representative ML classification datasets including 38 UDA tasks. Our experiment will be a basis for challenging UDA problems with large co-variate shift.

Two stages domain invariant representation learners solve the large co-variate shift in unsupervised domain adaptation with two dimensional data domains

TL;DR

The paper tackles unsupervised domain adaptation under large two-dimensional covariate shifts by introducing a two-stage, end-to-end domain invariant representation learning framework that leverages an unsupervised intermediate domain to bridge source, intermediate, and target distributions. A key feature is the L_propose loss, decomposing domain alignment across two adjacent domain pairs, which improves convergence and discriminability on the target. The authors also derive an RV-based theorem for unsupervised hyperparameter tuning, enabling parameter selection without target labels. Empirical results across four datasets and 38 tasks show that the proposed method consistently outperforms baselines (Step-by-step and Normal) and adheres to an upper-bound ordering, especially under larger shifts, with both quantitative gains and qualitative evidence of robust domain invariance and target discriminability. The approach is framed as broadly applicable, with potential extensions including synthetic intermediate domains and integration with other UDA techniques, making it practically impactful for real-world transfer learning scenarios.

Abstract

Recent developments in the unsupervised domain adaptation (UDA) enable the unsupervised machine learning (ML) prediction for target data, thus this will accelerate real world applications with ML models such as image recognition tasks in self-driving. Researchers have reported the UDA techniques are not working well under large co-variate shift problems where e.g. supervised source data consists of handwritten digits data in monotone color and unsupervised target data colored digits data from the street view. Thus there is a need for a method to resolve co-variate shift and transfer source labelling rules under this dynamics. We perform two stages domain invariant representation learning to bridge the gap between source and target with semantic intermediate data (unsupervised). The proposed method can learn domain invariant features simultaneously between source and intermediate also intermediate and target. Finally this achieves good domain invariant representation between source and target plus task discriminability owing to source labels. This induction for the gradient descent search greatly eases learning convergence in terms of classification performance for target data even when large co-variate shift. We also derive a theorem for measuring the gap between trained models and unsupervised target labelling rules, which is necessary for the free parameters optimization. Finally we demonstrate that proposing method is superiority to previous UDA methods using 4 representative ML classification datasets including 38 UDA tasks. Our experiment will be a basis for challenging UDA problems with large co-variate shift.

Paper Structure

This paper contains 21 sections, 1 theorem, 7 equations, 14 figures, 14 tables, 4 algorithms.

Key Result

Theorem 3.1

When executing Algorithm alg1, conditional distribution gap between $\phi(\cdot)$ and $\mathcal{D}_{T'}$'s ground truth is calculated by ($C_1, C_2$ as constant values)

Figures (14)

  • Figure 1: Forward path and backward process of Normal(DANNs), Step-by-step, Ours. The $L_{task}, -L_{domain}$ as $L_c, L_d$ for short.
  • Figure 2: Comparison of the evaluation values between methods at each rotation angle in Dataset A (the left is with DANNs, right with CoRALs). Detailed results are in Appendix \ref{['Quantitative results in detail']} Table \ref{['tab:result_detail_A_danns']} and \ref{['tab:result_detail_A_corals']} including the standard deviations.
  • Figure 3: Quantitative result overview covering 8 methods and 4 datasets.
  • Figure 4: The $L_{task}, L_{domain}(\mathcal{D}_S, \mathcal{D}_{T}), L_{domain}(\mathcal{D}_T, \mathcal{D}_{T'})$ and evaluation per epoch. The $L_{domain}(\mathcal{D}_S, \mathcal{D}_{T})$ as $L_{domain}(S,T)$ for short. Column is one trial for Dataset A (source$\rightarrow$30rotated$\rightarrow$60rotated), C ((d,s3mini)$\rightarrow$(e,s3)), D ((3, s)$\rightarrow$(1, w)) with DANNs.
  • Figure 5: Learned representation at different levels in the Dataset A (source$\rightarrow$30rotated$\rightarrow$60rotated) experiment with DANNs. The first column corresponds to feature representation with domain labels color, second feature representation with task labels and third one is predictive probability for grid space. Rows express methods(Ours, Step-by-step, Normal in order). Representations were gone through t-distributed stochastic neighbor embedding (t-SNE)lg.
  • ...and 9 more figures

Theorems & Definitions (1)

  • Theorem 3.1