Reinforced Domain Selection for Continuous Domain Adaptation

Hanbing Liu; Huaze Tang; Yanru Wu; Yang Li; Xiao-Ping Zhang

Reinforced Domain Selection for Continuous Domain Adaptation

Hanbing Liu, Huaze Tang, Yanru Wu, Yang Li, Xiao-Ping Zhang

TL;DR

Continuous Domain Adaptation (CDA) often suffers from selecting effective intermediate domains without explicit metadata. This work introduces a reinforcement learning framework fused with feature disentanglement to discover transfer paths in an unsupervised manner, guided by a reward based on latent-domain distances. A dual-network architecture isolates domain-invariant and domain-specific features, trained with mutual information-based losses and a supervised objective on the invariant features, while a policy generator learns which intermediate domains to traverse. Empirical results on Rotated MNIST and ADNI show improvements in target accuracy and path efficiency over traditional CDA methods, demonstrating the practicality of dynamic domain-path learning for robust cross-domain adaptation.

Abstract

Continuous Domain Adaptation (CDA) effectively bridges significant domain shifts by progressively adapting from the source domain through intermediate domains to the target domain. However, selecting intermediate domains without explicit metadata remains a substantial challenge that has not been extensively explored in existing studies. To tackle this issue, we propose a novel framework that combines reinforcement learning with feature disentanglement to conduct domain path selection in an unsupervised CDA setting. Our approach introduces an innovative unsupervised reward mechanism that leverages the distances between latent domain embeddings to facilitate the identification of optimal transfer paths. Furthermore, by disentangling features, our method facilitates the calculation of unsupervised rewards using domain-specific features and promotes domain adaptation by aligning domain-invariant features. This integrated strategy is designed to simultaneously optimize transfer paths and target task performance, enhancing the effectiveness of domain adaptation processes. Extensive empirical evaluations on datasets such as Rotated MNIST and ADNI demonstrate substantial improvements in prediction accuracy and domain selection efficiency, establishing our method's superiority over traditional CDA approaches.

Reinforced Domain Selection for Continuous Domain Adaptation

TL;DR

Abstract

Reinforced Domain Selection for Continuous Domain Adaptation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (3)