Generalization of Gibbs and Langevin Monte Carlo Algorithms in the Interpolation Regime
Andreas Maurer, Erfan Mirzaei, Massimiliano Pontil
TL;DR
The paper tackles why overparameterized models generalize despite interpolating training data by developing data-dependent, high-probability bounds for the test error of the Gibbs posterior and posterior mean across all temperatures, using an integral representation of the log-partition function and PAC-Bayesian ideas. It demonstrates that these bounds are stable under Langevin Monte Carlo approximations and validates the approach on MNIST and CIFAR-10, showing nontrivial bounds for true labels and correct upper bounds for random labels. Through calibration and ergodic-mean approximations, the method yields tight, temperature-aware guarantees in the interpolation regime, linking high-temperature training behavior to low-temperature generalization signals. The work provides a practical framework for certifying LMC-based learners and offers insight into how generalization can emerge even when training errors are small on data designed to fail test performance.
Abstract
The paper provides data-dependent bounds on the test error of the Gibbs algorithm in the overparameterized interpolation regime, where low training errors are also obtained for impossible data, such as random labels in classification. The bounds are stable under approximation with Langevin Monte Carlo algorithms. Experiments on the MNIST and CIFAR-10 datasets verify that the bounds yield nontrivial predictions on true labeled data and correctly upper bound the test error for random labels. Our method indicates that generalization in the low-temperature, interpolation regime is already signaled by small training errors in the more classical high temperature regime.
