Risk Phase Transitions in Spiked Regression: Alignment Driven Benign and Catastrophic Overfitting
Jiping Li, Rishi Sonthalia
TL;DR
The paper addresses generalization in high-dimensional linear regression under a spiked covariance model, focusing on minimum-norm interpolants and how spike strength and target-spike alignment shape generalization. It develops an exact risk decomposition into $Bias$, $Variance$, $Data oise$, and $Target ext{ Alignment}$ terms and uses this to classify regimes as benign, tempered, or catastrophic as the dimension parameter grows and spike scales vary. A key finding is that alignment with the spike is not universally beneficial: in well-specified aligned problems, increasing spike strength can drive transitions to catastrophic overfitting before benign overfitting appears, while misspecification and covariate shift can worsen or alter these transitions. The results extend beyond linear models, with nonlinear experiments (e.g., 3-layer ReLU nets) exhibiting similar alignment phase transitions, suggesting broad relevance for generalization in anisotropic data. Overall, the work provides a detailed map of how spectral structure and target alignment govern generalization in overparameterized regimes, challenging naive isotropic intuitions and informing model selection under spectral heterogeneity.
Abstract
This paper analyzes the generalization error of minimum-norm interpolating solutions in linear regression using spiked covariance data models. The paper characterizes how varying spike strengths and target-spike alignments can affect risk, especially in overparameterized settings. The study presents an exact expression for the generalization error, leading to a comprehensive classification of benign, tempered, and catastrophic overfitting regimes based on spike strength, the aspect ratio $c=d/n$ (particularly as $c \to \infty$), and target alignment. Notably, in well-specified aligned problems, increasing spike strength can surprisingly induce catastrophic overfitting before achieving benign overfitting. The paper also reveals that target-spike alignment is not always advantageous, identifying specific, sometimes counterintuitive, conditions for its benefit or detriment. Alignment with the spike being detrimental is empirically demonstrated to persist in nonlinear models.
