Consistency Regularization for Domain Generalization with Logit Attribution Matching
Han Gao, Kaican Li, Weiyan Xie, Zhi Lin, Yongxiang Huang, Luning Wang, Caleb Chen Cao, Nevin L. Zhang
TL;DR
This work addresses domain generalization (DG) under distribution shifts by leveraging semantic sharing (SS) pairs created through data augmentation to regularize predictions. It develops a causal latent decomposition (CLD) theory with causal factors $X^{\mathrm{c}}$ and non-causal factors $X^{\mathrm{n}}$, and proves that causally invariant predictions minimize out-of-distribution loss when target support is contained in source support. The authors introduce Logit Attribution Matching (LAM), a CR method using labeled SS pairs that regularizes the logit contributions of the target class, and demonstrate its superior performance over ERM+DA and other CR/DG baselines on five benchmarks across varied architectures. The work highlights the practical value of labeled SS pairs for robust DG and provides public code and data to support reproducibility.
Abstract
Domain generalization (DG) is about training models that generalize well under domain shift. Previous research on DG has been conducted mostly in single-source or multi-source settings. In this paper, we consider a third, lesser-known setting where a training domain is endowed with a collection of pairs of examples that share the same semantic information. Such semantic sharing (SS) pairs can be created via data augmentation and then utilized for consistency regularization (CR). We present a theory showing CR is conducive to DG and propose a novel CR method called Logit Attribution Matching (LAM). We conduct experiments on five DG benchmarks and four pretrained models with SS pairs created by both generic and targeted data augmentation methods. LAM outperforms representative single/multi-source DG methods and various CR methods that leverage SS pairs. The code and data of this project are available at https://github.com/Gaohan123/LAM
