Anti-causal domain generalization: Leveraging unlabeled data
Sorawit Saengkyongam, Juan L. Gamella, Andrew C. Miller, Jonas Peters, Nicolai Meinshausen, Christina Heinze-Deml
TL;DR
This work tackles domain generalization under an anti-causal regime where the outcome $Y$ drives the observed covariates $X$, allowing unlabeled data from multiple environments to inform how distributions shift. It introduces two regularizers, Mean-based Invariant Regularization (MIR) and Variance-based Invariant Regularization (VIR), which penalize sensitivity to mean and covariance shifts estimated from unlabeled data, establishing worst-case robustness guarantees for linear predictors and extending naturally to nonlinear representations. The authors provide population and plug-in estimators, prove consistency, and demonstrate empirical gains on a controlled Light Tunnel and the VitalDB stroke-volume dataset, especially when labeled environments are scarce. These methods enable robust performance under distribution shifts without requiring outcome labels across many environments, with practical impact in safety-critical settings like healthcare and physics-inspired sensing. The framework supports extensions to alternative losses and nonlinear models via representation learning, broadening applicability to high-dimensional, unstructured data.
Abstract
The problem of domain generalization concerns learning predictive models that are robust to distribution shifts when deployed in new, previously unseen environments. Existing methods typically require labeled data from multiple training environments, limiting their applicability when labeled data are scarce. In this work, we study domain generalization in an anti-causal setting, where the outcome causes the observed covariates. Under this structure, environment perturbations that affect the covariates do not propagate to the outcome, which motivates regularizing the model's sensitivity to these perturbations. Crucially, estimating these perturbation directions does not require labels, enabling us to leverage unlabeled data from multiple environments. We propose two methods that penalize the model's sensitivity to variations in the mean and covariance of the covariates across environments, respectively, and prove that these methods have worst-case optimality guarantees under certain classes of environments. Finally, we demonstrate the empirical performance of our approach on a controlled physical system and a physiological signal dataset.
