Variational Gaussian Approximation in Replica Analysis of Parametric Models
Takashi Takahashi
TL;DR
This work extends the replica method to parametric inference with unknown data distributions by introducing a grand canonical replicated system and a variational Gaussian approximation (VGA) that adaptively tunes a quadratic trial Hamiltonian per dataset.The method yields tractable stationarity conditions for the VGA parameters $\bm{m}$, $q$, and $\chi$, decomposes estimator fluctuations into quenched and thermal components, and connects to information criteria such as PCIC/WAIC through a controlled expansion.Applied to linear regression, the approach produces learning curves and generalization predictions that remain accurate on synthetic teacher–student data as well as real-world data (Year Prediction MSD), even when $p_{\rm data}$ is unknown.Overall, the GC-VGA framework enables finite-size, data-distribution-agnostic analysis of parametric models and offers a path toward analyzing more complex models while preserving links to classical statistical criteria.
Abstract
We revisit the replica method for analyzing inference and learning in parametric models, considering situations where the data-generating distribution is unknown or analytically intractable. Instead of assuming idealized distributions to carry out quenched averages analytically, we use a variational Gaussian approximation for the replicated system in grand canonical formalism in which the data average can be deferred and replaced by empirical averages, leading to stationarity conditions that adaptively determine the parameters of the trial Hamiltonian for each dataset. This approach clarifies how fluctuations affect information extraction and connects directly with the results of mathematical statistics or learning theory such as information criteria. As a concrete application, we analyze linear regression and derive learning curves. This includes cases with real-world datasets, where exact replica calculations are not feasible.
