Do you understand epistemic uncertainty? Think again! Rigorous frequentist epistemic uncertainty estimation in regression
Enrico Foglia, Benjamin Bobbia, Nikita Durasov, Michael Bauerheim, Pascal Fua, Stephane Moreau, Thierry Jardin
TL;DR
This paper tackles the challenge of distinguishing epistemic from aleatoric uncertainty in regression by introducing a frequentist framework that trains a model to output two conditional predictions per input, where the second prediction conditions on the first. By adopting calibration principles and triplet data $(x,y_1,y_2)$, the authors show that the model covariance ${\rm cov}_\theta(x) = \mathbb{V}[\mathbb{E}[Y|X]\;|\;[x]]$ captures the epistemic component, while the total predictive variance decomposes as $\mathbb{V}_\theta[Y|x] = \mathbb{E}[\mathbb{V}[Y|X]\;|\,[x]] + \mathbb{V}[\mathbb{E}[Y|X]\;|\,[x]]$. They provide a practical training objective using negative log-likelihood for the joint predictor $p_\theta(y_1,y_2|x)$ and show how to estimate covariance via Monte Carlo sampling, all with minimal changes to standard architectures. Validation on synthetic data and real experiments (airfoil aerodynamics and drone noise) demonstrates that ${\rm cov}_\theta(x)$ tracks epistemic uncertainty and grows outside the training domain, supporting a robust, repeatable assessment of prediction reliability in regression settings. The work offers a distinct, non-Bayesian avenue for uncertainty quantification in regression tasks, particularly valuable when repeatable measurements exist and calibration can be achieved.
Abstract
Quantifying model uncertainty is critical for understanding prediction reliability, yet distinguishing between aleatoric and epistemic uncertainty remains challenging. We extend recent work from classification to regression to provide a novel frequentist approach to epistemic and aleatoric uncertainty estimation. We train models to generate conditional predictions by feeding their initial output back as an additional input. This method allows for a rigorous measurement of model uncertainty by observing how prediction responses change when conditioned on the model's previous answer. We provide a complete theoretical framework to analyze epistemic uncertainty in regression in a frequentist way, and explain how it can be exploited in practice to gauge a model's uncertainty, with minimal changes to the original architecture.
