Should Bias be Eliminated? A General Framework to Use Bias for OOD Generalization

Yan Li; Yunlong Deng; Zijian Li; Anpeng Wu; Zeyu Tang; Kun Zhang; Guangyi Chen

Should Bias be Eliminated? A General Framework to Use Bias for OOD Generalization

Yan Li, Yunlong Deng, Zijian Li, Anpeng Wu, Zeyu Tang, Kun Zhang, Guangyi Chen

TL;DR

The paper addresses OOD generalization by reframing bias as a potential resource rather than a nuisance. It introduces a generative, causal-informed framework that disentangles content ${c}$ from bias ${b}$, then leverages ${b}$ through an environment-routing mechanism and an adaptive label prior to improve predictions under domain and label shifts. Theoretical results establish block-wise identifiability of ${c}$ and ${b}$ and show when bias can contribute to prediction via unblocked causal paths to ${y}$; empirically, BAG outperforms invariance-only baselines and prior bias-utilization methods on synthetic data and standard DG benchmarks. This approach offers a principled way to harness bias for robust, transferable models, with practical implications for real-world deployments where domain and label distributions shift.

Abstract

Most approaches to out-of-distribution (OOD) generalization learn domain-invariant representations by discarding contextual bias. In this paper, we raise a critical question: Should bias be eliminated? If not, is there a general way to leverage bias for better OOD generalization? To answer these questions, we first provide a theoretical analysis that characterizes the circumstances in which biased features contribute positively. Although theoretical results show that bias may sometimes play a positive role, leveraging it effectively is non-trivial, since its harmful and beneficial components are often entangled. Recent advances have sought to refine the prediction of bias by presuming reliable predictions from invariant features. However, such assumptions may be too strong in the real world, especially when the target also shifts from training to testing domains. Motivated by this challenge, we introduce a framework to leverage bias in a more general scenario. Specifically, we employ a generative model to capture the data generation process and identify the underlying bias factors, which are then used to construct a bias-aware predictor. Since the bias-aware predictor may shift across environments, we first estimate the environment state to train predictors under different environments, combining them as a mixture of domain experts for the final prediction. Then, we build a general invariant predictor, which can be invariant under label shift to guide the adaptation of the bias-aware predictor. Evaluations on synthetic data and standard domain generalization benchmarks demonstrate that our method consistently outperforms both invariance only baselines, recent bias utilization approaches and advanced baselines, yielding improved robustness and adaptability.

Should Bias be Eliminated? A General Framework to Use Bias for OOD Generalization

TL;DR

Abstract

Should Bias be Eliminated? A General Framework to Use Bias for OOD Generalization

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (5)

Theorems & Definitions (8)