The Eigenlearning Framework: A Conservation Law Perspective on Kernel Regression and Wide Neural Networks
James B. Simon, Madeline Dickens, Dhruva Karkada, Michael R. DeWeese
TL;DR
The paper introduces an eigenlearning framework for kernel ridge regression built on a sharp learnability conservation law, showing that the total learnability across a complete basis is bounded by the number of training samples when ridge is zero. By deriving closed-form, modewise estimators—most notably the eigenlearning equations with $\mathcal{L}_i=\lambda_i/(\lambda_i+\kappa)$ and the self-consistent condition $n=\sum_i \lambda_i/(\lambda_i+\kappa)+\delta/\kappa$—the authors express test risk, bias, variance, and related quantities purely in terms of eigenmode learnabilities. The framework yields insights into phenomena such as the deep bootstrap, parity problem hardness for rotation-invariant kernels, and a mean-squared-gradient measure relevant to adversarial robustness, while revealing a deep connection to the free Fermi gas via an explicit expression for $\kappa$ using elementary symmetric polynomials. The approach simplifies prior methods, provides sharp equalities at finite samples, and offers a versatile tool for analyzing generalization and robustness in kernel methods and wide neural networks. Overall, the work delivers interpretable, quantitative predictions across synthetic and real domains and paves the way for applying eigenmode learnability to broader kernel and neural-network contexts.
Abstract
We derive simple closed-form estimates for the test risk and other generalization metrics of kernel ridge regression (KRR). Relative to prior work, our derivations are greatly simplified and our final expressions are more readily interpreted. These improvements are enabled by our identification of a sharp conservation law which limits the ability of KRR to learn any orthonormal basis of functions. Test risk and other objects of interest are expressed transparently in terms of our conserved quantity evaluated in the kernel eigenbasis. We use our improved framework to: i) provide a theoretical explanation for the "deep bootstrap" of Nakkiran et al (2020), ii) generalize a previous result regarding the hardness of the classic parity problem, iii) fashion a theoretical tool for the study of adversarial robustness, and iv) draw a tight analogy between KRR and a well-studied system in statistical physics.
