Cross-Domain Feature Augmentation for Domain Generalization
Yingnan Liu, Yingtian Zou, Rui Qiao, Fusheng Liu, Mong Li Lee, Wynne Hsu
TL;DR
The paper tackles domain generalization by shifting augmentation from the input space to the feature space, introducing a semantic feature decomposition into four components $Z_{c,d}$, $Z_{c,\\neg d}$, $Z_{\\neg c,d}$, $Z_{\\neg c,\\neg d}$ and a cross-domain augmentation scheme named $\text{XDomainMix}$. It employs a two-phase training procedure with a warm-up on original data followed by augmentation-aware learning that optimizes predictions on both $Z$ and the augmented $\tilde{Z}$ via a combined loss $\mathcal{L}_{aug}$, while a domain classifier is trained only on original features. Empirical results across multiple domain-generalization benchmarks demonstrate state-of-the-art performance, improved invariance at both representation and prediction levels, and greater augmentation diversity as measured by MMD and covariance-based metrics. The work highlights the practical value of semantically structured feature augmentation for robust cross-domain generalization, while acknowledging limitations and offering avenues for theoretical grounding and integration with sharpness-aware optimization techniques.
Abstract
Domain generalization aims to develop models that are robust to distribution shifts. Existing methods focus on learning invariance across domains to enhance model robustness, and data augmentation has been widely used to learn invariant predictors, with most methods performing augmentation in the input space. However, augmentation in the input space has limited diversity whereas in the feature space is more versatile and has shown promising results. Nonetheless, feature semantics is seldom considered and existing feature augmentation methods suffer from a limited variety of augmented features. We decompose features into class-generic, class-specific, domain-generic, and domain-specific components. We propose a cross-domain feature augmentation method named XDomainMix that enables us to increase sample diversity while emphasizing the learning of invariant representations to achieve domain generalization. Experiments on widely used benchmark datasets demonstrate that our proposed method is able to achieve state-of-the-art performance. Quantitative analysis indicates that our feature augmentation approach facilitates the learning of effective models that are invariant across different domains.
