Incorporating priors in learning: a random matrix study under a teacher-student framework
Malik Tiomoko, Ekkehard Schnoor
TL;DR
The paper tackles how informative Gaussian priors affect generalization in high-dimensional MAP regression under proportional asymptotics. It develops exact asymptotic risk formulas via random matrix theory, revealing a bias–variance–prior tradeoff, explaining double descent, and quantifying prior mismatch. It provides a closed-form minimizer for the test risk in the identity covariance case, and extends to general covariance with a numerically computable optimal regularization. The results offer theoretical clarity and practical guidance for leveraging domain knowledge in high-dimensional learning, with implications for transfer learning and time-series forecasting. Overall, the work bridges Bayesian priors, classical regularization, and modern asymptotics, delivering actionable insights and robust estimators for regularization parameters.
Abstract
Regularized linear regression is central to machine learning, yet its high-dimensional behavior with informative priors remains poorly understood. We provide the first exact asymptotic characterization of training and test risks for maximum a posteriori (MAP) regression with Gaussian priors centered at a domain-informed initialization. Our framework unifies ridge regression, least squares, and prior-informed estimators, and -- using random matrix theory -- yields closed-form risk formulas that expose the bias-variance-prior tradeoff, explain double descent, and quantify prior mismatch. We also identify a closed-form minimizer of test risk, enabling a simple estimator of the optimal regularization parameter. Simulations confirm the theory with high accuracy. By connecting Bayesian priors, classical regularization, and modern asymptotics, our results provide both conceptual clarity and practical guidance for learning with structured prior knowledge.
