Spurious Correlations in High Dimensional Regression: The Roles of Regularization, Simplicity Bias and Over-Parameterization
Simone Bombari, Marco Mondelli
TL;DR
This work provides a precise statistical account of spurious correlations in high-dimensional regression, modeling inputs as a core feature $x$ and a spurious feature $y$, and quantifying the learned spurious signal via $\mathcal{C}$. For linear regression with ridge regularization, the authors derive a non-asymptotic, deterministic target $\mathcal{C}^{\Sigma}(\lambda)$ that $\mathcal{C}(\hat{\theta}_{LR}(\lambda))$ concentrates to, with the target expressed through the data covariance $\Sigma$ and its Schur complement relative to $x$. They show a trade-off between in-distribution test loss $\mathcal{L}$ and spurious correlations: the regularization strength $\lambda$ that minimizes $\mathcal{L}$ lies in an interval where $\mathcal{C}^{\Sigma}(\lambda)$ is increasing, implying that some spurious signal can be beneficial in-distribution. Extending to over-parameterized models, they prove an exact random-features equivalence to regularized LR with an effective $\tilde{\lambda}$ determined by the activation’s Hermite coefficients, which preserves spurious correlations in the RF setting. Together with experiments on Gaussian data, Color-MNIST, and CIFAR-10, the results illuminate how regularization, simplicity bias, and over-parameterization shape the reliance on spurious features and their impact on robustness and generalization.
Abstract
Learning models have been shown to rely on spurious correlations between non-predictive features and the associated labels in the training data, with negative implications on robustness, bias and fairness. In this work, we provide a statistical characterization of this phenomenon for high-dimensional regression, when the data contains a predictive core feature $x$ and a spurious feature $y$. Specifically, we quantify the amount of spurious correlations $C$ learned via linear regression, in terms of the data covariance and the strength $λ$ of the ridge regularization. As a consequence, we first capture the simplicity of $y$ through the spectrum of its covariance, and its correlation with $x$ through the Schur complement of the full data covariance. Next, we prove a trade-off between $C$ and the in-distribution test loss $L$, by showing that the value of $λ$ that minimizes $L$ lies in an interval where $C$ is increasing. Finally, we investigate the effects of over-parameterization via the random features model, by showing its equivalence to regularized linear regression. Our theoretical results are supported by numerical experiments on Gaussian, Color-MNIST, and CIFAR-10 datasets.
