From Risk to Uncertainty: Generating Predictive Uncertainty Measures via Bayesian Estimation
Nikita Kotelevskii, Vladimir Kondratyev, Martin Takáč, Éric Moulines, Maxim Panov
TL;DR
This paper presents a unified risk-based framework to quantify predictive uncertainty by decomposing pointwise risk into aleatoric and epistemic components using strictly proper scoring rules and Bayesian estimation. By expressing $R_{Tot}$ as $R_{Bayes}+R_{Exc}$ and leveraging Bayesian predictions, the authors show how well-known uncertainty measures (e.g., Mutual Information, EPKL) arise as special cases under different approximations, and they connect the framework to energy-based models. Through extensive experiments on CIFAR10/100 and TinyImageNet, they show that Log-score-based measures are generally effective for OOD detection, while Bayes and Total risks tend to excel at misclassification detection, with Excess risk offering advantages in soft-OOD scenarios. The work provides practical guidance on selecting uncertainty measures based on task (OOD vs misclassification) and data regime (soft- vs hard-OOD), and it establishes a theoretical link between diverse uncertainty metrics within a single Bayesian risk framework.
Abstract
There are various measures of predictive uncertainty in the literature, but their relationships to each other remain unclear. This paper uses a decomposition of statistical pointwise risk into components, associated with different sources of predictive uncertainty, namely aleatoric uncertainty (inherent data variability) and epistemic uncertainty (model-related uncertainty). Together with Bayesian methods, applied as an approximation, we build a framework that allows one to generate different predictive uncertainty measures. We validate our method on image datasets by evaluating its performance in detecting out-of-distribution and misclassified instances using the AUROC metric. The experimental results confirm that the measures derived from our framework are useful for the considered downstream tasks.
