Class-Conditional Distribution Balancing for Group Robust Classification
Miaoyun Zhao, Chenrong Li, Qiang Zhang
TL;DR
This work reframes spurious correlations as imbalances in class-conditional distributions and introduces Class-Conditional Distribution Balancing (CCDB), a bias-annotation-free method that maximizes the uncertainty of the label given spurious factors. Through a three-stage, class-aware sample-reweighting pipeline, CCDB balances class-conditional statistics via $W_2$ distances and per-class weights, effectively diminishing spurious cues. Extensive experiments on synthetic multi-class tasks and real-world benchmarks show CCDB achieving state-of-the-art or competitive performance without bias supervision, often rivaling bias-guided approaches. The approach leverages causal-inference notions of covariate balance and demonstrates practical utility across diverse domains, with detailed ablations and analysis of learned sample weights.
Abstract
Spurious correlations that lead models to correct predictions for the wrong reasons pose a critical challenge for robust real-world generalization. Existing research attributes this issue to group imbalance and addresses it by maximizing group-balanced or worst-group accuracy, which heavily relies on expensive bias annotations. A compromise approach involves predicting bias information using extensively pretrained foundation models, which requires large-scale data and becomes impractical for resource-limited rare domains. To address these challenges, we offer a novel perspective by reframing the spurious correlations as imbalances or mismatches in class-conditional distributions, and propose a simple yet effective robust learning method that eliminates the need for both bias annotations and predictions. With the goal of maximizing the conditional entropy (uncertainty) of the label given spurious factors, our method leverages a sample reweighting strategy to achieve class-conditional distribution balancing, which automatically highlights minority groups and classes, effectively dismantling spurious correlations and producing a debiased data distribution for classification. Extensive experiments and analysis demonstrate that our approach consistently delivers state-of-the-art performance, rivaling methods that rely on bias supervision.
