Domain Generalization via Optimal Transport with Metric Similarity Learning
Fan Zhou, Zhuqing Jiang, Changjian Shui, Boyu Wang, Brahim Chaib-draa
TL;DR
The paper tackles domain generalization by learning invariant features across multiple source domains to generalize to unseen targets. It introduces Wasserstein Adversarial Domain Generalization (WADG), which combines optimal transport-based feature alignment using the Wasserstein distance $W_1$ with a metric-learning objective to enforce domain-agnostic, discriminative boundaries. OT constrains class-conditional features to stay cohesive while the metric loss promotes separability across classes, improving boundary clarity. Experiments on VLCS, PACS, and Office-Home show consistent improvements over baselines, with ablations validating the contributions of both the OT alignment and the metric-learning component.
Abstract
Generalizing knowledge to unseen domains, where data and labels are unavailable, is crucial for machine learning models. We tackle the domain generalization problem to learn from multiple source domains and generalize to a target domain with unknown statistics. The crucial idea is to extract the underlying invariant features across all the domains. Previous domain generalization approaches mainly focused on learning invariant features and stacking the learned features from each source domain to generalize to a new target domain while ignoring the label information, which will lead to indistinguishable features with an ambiguous classification boundary. For this, one possible solution is to constrain the label-similarity when extracting the invariant features and to take advantage of the label similarities for class-specific cohesion and separation of features across domains. Therefore we adopt optimal transport with Wasserstein distance, which could constrain the class label similarity, for adversarial training and also further deploy a metric learning objective to leverage the label information for achieving distinguishable classification boundary. Empirical results show that our proposed method could outperform most of the baselines. Furthermore, ablation studies also demonstrate the effectiveness of each component of our method.
