Gaussian Universality of Perceptrons with Random Labels
Federica Gerace, Florent Krzakala, Bruno Loureiro, Ludovic Stephan, Lenka Zdeborová
TL;DR
The paper addresses whether Gaussian-design results for perceptrons with random labels extend to realistic, high-dimensional data. It develops a rigorous universality framework showing that mixtures of Gaussians with random labels are effectively equivalent to a single Gaussian design with matching covariance in the proportional regime, and that this universality holds for generic convex losses as regularization vanishes, with a strong form for ridge regression. The authors derive exact asymptotic expressions via replica-analysis and validate them with extensive numerical experiments on real datasets preprocessed by random features or scattering transforms. The findings illuminate why Gaussian-based theory often captures practical learning behavior and offer a path toward analytically tractable insights for high-dimensional learning on real data, with potential extensions to non-convex losses and deeper networks.
Abstract
While classical in many theoretical settings - and in particular in statistical physics-inspired works - the assumption of Gaussian i.i.d. input data is often perceived as a strong limitation in the context of statistics and machine learning. In this study, we redeem this line of work in the case of generalized linear classification, a.k.a. the perceptron model, with random labels. We argue that there is a large universality class of high-dimensional input data for which we obtain the same minimum training loss as for Gaussian data with corresponding data covariance. In the limit of vanishing regularization, we further demonstrate that the training loss is independent of the data covariance. On the theoretical side, we prove this universality for an arbitrary mixture of homogeneous Gaussian clouds. Empirically, we show that the universality holds also for a broad range of real datasets.
