Mitigating Spurious Correlations via Disagreement Probability

Hyeonggeun Han; Sehwan Kim; Hyungjun Joo; Sangwoo Hong; Jungwoo Lee

Mitigating Spurious Correlations via Disagreement Probability

Hyeonggeun Han, Sehwan Kim, Hyungjun Joo, Sangwoo Hong, Jungwoo Lee

TL;DR

This work tackles the problem of spurious correlations in supervised learning when bias labels are unavailable. It introduces a bias-label-free objective and DPR, a disagreement-probability–based resampling method that upweights bias-conflicting samples using a deliberately biased model as a group proxy. The authors provide theoretical bounds showing that DPR reduces loss disparity between bias-aligned and bias-conflicting groups while lowering the average loss, and demonstrate state-of-the-art performance across six benchmarks, including challenging real-world datasets. The approach relies on a two-stage training process and calibration of a biased model, yet delivers practical gains in robustness and generalization to unseen data with minimal bias-label requirements.

Abstract

Models trained with empirical risk minimization (ERM) are prone to be biased towards spurious correlations between target labels and bias attributes, which leads to poor performance on data groups lacking spurious correlations. It is particularly challenging to address this problem when access to bias labels is not permitted. To mitigate the effect of spurious correlations without bias labels, we first introduce a novel training objective designed to robustly enhance model performance across all data samples, irrespective of the presence of spurious correlations. From this objective, we then derive a debiasing method, Disagreement Probability based Resampling for debiasing (DPR), which does not require bias labels. DPR leverages the disagreement between the target label and the prediction of a biased model to identify bias-conflicting samples-those without spurious correlations-and upsamples them according to the disagreement probability. Empirical evaluations on multiple benchmarks demonstrate that DPR achieves state-of-the-art performance over existing baselines that do not use bias labels. Furthermore, we provide a theoretical analysis that details how DPR reduces dependency on spurious correlations.

Mitigating Spurious Correlations via Disagreement Probability

TL;DR

Abstract

Mitigating Spurious Correlations via Disagreement Probability

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (8)

Theorems & Definitions (4)