Revisiting Spurious Correlation in Domain Generalization

Bin Qin; Jiangmeng Li; Yi Li; Xuesong Wu; Yupeng Wang; Wenwen Qiang; Jianwen Cao

Revisiting Spurious Correlation in Domain Generalization

Bin Qin, Jiangmeng Li, Yi Li, Xuesong Wu, Yupeng Wang, Wenwen Qiang, Jianwen Cao

TL;DR

This work reframes domain generalization through a representation-learning Structural Causal Model (SCM) to analyze spurious correlations that cause OOD failure. It distinguishes fork-specific and collider-specific spurious correlations and argues that collider-specific spurious paths are the correct OOD-oriented mechanism, guiding debiasing via backdoor adjustment. The authors introduce a Propensity Score Weighted (PSW) estimator, augmented with FFT-based feature separation and a pairing scheme to robustly estimate propensity scores and reweight observations, enabling plug-in integration with existing OOD methods. Empirical results across six benchmarks show consistent generalization gains and SOTA performance on several datasets, demonstrating the practical utility and versatility of the proposed PSIW framework for robust OOD generalization.

Abstract

Without loss of generality, existing machine learning techniques may learn spurious correlation dependent on the domain, which exacerbates the generalization of models in out-of-distribution (OOD) scenarios. To address this issue, recent works build a structural causal model (SCM) to describe the causality within data generation process, thereby motivating methods to avoid the learning of spurious correlation by models. However, from the machine learning viewpoint, such a theoretical analysis omits the nuanced difference between the data generation process and representation learning process, resulting in that the causal analysis based on the former cannot well adapt to the latter. To this end, we explore to build a SCM for representation learning process and further conduct a thorough analysis of the mechanisms underlying spurious correlation. We underscore that adjusting erroneous covariates introduces bias, thus necessitating the correct selection of spurious correlation mechanisms based on practical application scenarios. In this regard, we substantiate the correctness of the proposed SCM and further propose to control confounding bias in OOD generalization by introducing a propensity score weighted estimator, which can be integrated into any existing OOD method as a plug-and-play module. The empirical results comprehensively demonstrate the effectiveness of our method on synthetic and large-scale real OOD datasets.

Revisiting Spurious Correlation in Domain Generalization

TL;DR

Abstract

Paper Structure (32 sections, 3 theorems, 28 equations, 8 figures, 8 tables)

This paper contains 32 sections, 3 theorems, 28 equations, 8 figures, 8 tables.

Introduction
Spurious Correlation on OOD Generalization
Problem Setting
Learning Representation Process
Fork-Specific Spurious Correlation.
Theoretical Analysis of Fork-Specific Spurious Correlation.
Causal graph with spurious correlation on OOD Generalization
Collider-Specific Spurious Correlation.
Theoretical Analysis of Collider-Specific Spurious Correlation.
Controlling Confounding Bias on OOD Generalization
Propensity-Scored ERM for OOD Generalization
Calculation of Propensity Scores
Separate Invariant Features and Spurious Features via FFT.
Pairing Scheme for Improving Propensity Score Estimation.
Learning Objectives.
...and 17 more sections

Key Result

Proposition 1

(Latent common cause Verma1993) In causal model $M$, the spurious correlation between variables $S$ and $Y$: $S - Y$ represents a latent common cause $S\leftarrow L\rightarrow Y$.

Figures (8)

Figure 1: The SCM graph of representation learning for OOD. In Figure (a), the dash between $S$ and $Y$ represents the complex spurious correlation, where the direction of causal arrows cannot be determined. Figure (b) illustrates fork-specific spurious correlation, replacing $S - Y$ with $S\leftarrow L\rightarrow Y$, which is widely accepted as the latent common cause proposition in causal inference.
Figure 2: On ColoredMNIST dataset, when the distribution of the training and test sets is consistent (i.e., no bias), ERM achieves an accuracy rate of over 95%, even the labels are associated with colors to varying degrees. However, when the dataset is biased, associating labels with colors leads to a sharp decline in the accuracy of ERM.
Figure 3: The SCM graph of collider-specific spurious correlation model, with grey nodes indicating conditioning on that variable. Figure (b) is the extracted confounding path that introduces bias in estimating $C \rightarrow Y$.
Figure 4: The results of hyper-parameters.
Figure 5: The results of case study on a batch of randomly selected samples.
...and 3 more figures

Theorems & Definitions (12)

Definition 1
Proposition 1
Definition 2
Theorem 1
Definition 3
Definition 4
Corollary 1
Definition 5
Definition 6
Definition 7
...and 2 more

Revisiting Spurious Correlation in Domain Generalization

TL;DR

Abstract

Revisiting Spurious Correlation in Domain Generalization

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (8)

Theorems & Definitions (12)