An Investigation of Why Overparameterization Exacerbates Spurious Correlations
Shiori Sagawa, Aditi Raghunathan, Pang Wei Koh, Percy Liang
TL;DR
This paper investigates why overparameterization can worsen worst-group generalization when data contain spurious correlations, despite improving average test error. It combines empirical studies on CelebA and Waterbirds with synthetic simulations to identify two key data properties that modulate this effect: the majority/minority group proportion $p_\mathsf{maj}$ and the spurious-to-core information ratio $r_\mathsf{s:c}$. A theoretical analysis in an explicit-memorization linear setting shows that the minimum-norm inductive bias in overparameterized regimes can lead models to memorize minority points via spurious features, increasing worst-group error; underparameterized models that rely on core features tend to generalize better across groups. The paper also demonstrates that subsampling the majority group can counterintuitively reduce worst-group error in the overparameterized regime, sometimes matching or beating reweighted underparameterized baselines, indicating a fundamental trade-off between average accuracy and worst-group robustness. Together, these results highlight a tension between the benefits of overparameterization for average performance and the need to preserve robust performance on underrepresented groups.
Abstract
We study why overparameterization -- increasing model size well beyond the point of zero training error -- can hurt test error on minority groups despite improving average test error when there are spurious correlations in the data. Through simulations and experiments on two image datasets, we identify two key properties of the training data that drive this behavior: the proportions of majority versus minority groups, and the signal-to-noise ratio of the spurious correlations. We then analyze a linear setting and theoretically show how the inductive bias of models towards "memorizing" fewer examples can cause overparameterization to hurt. Our analysis leads to a counterintuitive approach of subsampling the majority group, which empirically achieves low minority error in the overparameterized regime, even though the standard approach of upweighting the minority fails. Overall, our results suggest a tension between using overparameterized models versus using all the training data for achieving low worst-group error.
