On the Effect of Regularization on Nonparametric Mean-Variance Regression
Eliot Wong-Toi, Alex Boyd, Vincent Fortuin, Stephan Mandt
TL;DR
The paper investigates why overparameterized mean-variance regression exhibits sharp phase transitions as regularization changes, hindering reliable uncertainty quantification. It introduces a field-theoretic framework that casts the learning problem as a variational problem over mean and input-dependent precision fields, deriving coupled Euler–Lagrange equations that reveal how data fidelity and smoothing compete across input space. A Bayesian reformulation (BFT) places smoothness priors directly on the predictor fields, linking the deterministic FT to Gaussian-process–like priors and enabling ensemble-based uncertainty estimation. Experiments on synthetic data, UCI benchmarks, and ClimSim show consistent phase diagrams with stable, underfitting, and overfitting regimes, and demonstrate that a one-dimensional reparameterization of regularization along a diagonal reduces hyperparameter search while maintaining calibration performance. This work provides both theoretical insight into MVR instabilities and a practical tuning strategy that improves robust uncertainty quantification in large-scale, heterogeneous data contexts.
Abstract
Uncertainty quantification is vital for decision-making and risk assessment in machine learning. Mean-variance regression models, which predict both a mean and residual noise for each data point, provide a simple approach to uncertainty quantification. However, overparameterized mean-variance models struggle with signal-to-noise ambiguity, deciding whether prediction targets should be attributed to signal (mean) or noise (variance). At one extreme, models fit all training targets perfectly with zero residual noise, while at the other, they provide constant, uninformative predictions and explain the targets as noise. We observe a sharp phase transition between these extremes, driven by model regularization. Empirical studies with varying regularization levels illustrate this transition, revealing substantial variability across repeated runs. To explain this behavior, we develop a statistical field theory framework, which captures the observed phase transition in alignment with experimental results. This analysis reduces the regularization hyperparameter search space from two dimensions to one, significantly lowering computational costs. Experiments on UCI datasets and the large-scale ClimSim dataset demonstrate robust calibration performance, effectively quantifying predictive uncertainty.
