Second-Order Uncertainty Quantification: Variance-Based Measures
Yusuf Sale, Paul Hofman, Lisa Wimmer, Eyke Hüllermeier, Thomas Nagler
TL;DR
This work addresses quantifying second-order uncertainty in probabilistic classification by introducing variance-based measures that decompose uncertainty into label-wise aleatoric ($\mathrm{AU}_k$) and epistemic ($\mathrm{EU}_k$) components using the law of total variance. By defining $\Theta_k=P(Y_k=1)$ and modeling $\Theta=(\Theta_1,...,\Theta_K)$ with a second-order distribution $Q\in\Delta_K^{(2)}$, the authors derive total, aleatoric, and epistemic uncertainties as $\mathrm{TU}_k=\mathrm{Var}(Y_k)$, $\mathrm{AU}_k=\mathbb{E}[\Theta_k(1-\Theta_k)]$, and $\mathrm{EU}_k=\mathrm{Var}(\Theta_k)$, with global versions using weights $w_k$. They argue and prove that these variance-based measures satisfy a set of axioms (A0–A7) ensuring non-negativity, correct limiting behavior, and sensible aggregation across partitions and mean-preserving spreads. Empirically, the variance-based measures are competitive with entropy-based baselines on accuracy-rejection curves, correct/incorrect prediction analyses, and out-of-distribution detection, while offering finer-grained label-wise uncertainty insights beneficial for risk-aware decision making. Overall, the approach provides a principled, interpretable alternative to entropy for second-order uncertainty in classification, with demonstrated practical utility in safety-critical contexts and clear directions for future work on loss-based uncertainty decompositions.
Abstract
Uncertainty quantification is a critical aspect of machine learning models, providing important insights into the reliability of predictions and aiding the decision-making process in real-world applications. This paper proposes a novel way to use variance-based measures to quantify uncertainty on the basis of second-order distributions in classification problems. A distinctive feature of the measures is the ability to reason about uncertainties on a class-based level, which is useful in situations where nuanced decision-making is required. Recalling some properties from the literature, we highlight that the variance-based measures satisfy important (axiomatic) properties. In addition to this axiomatic approach, we present empirical results showing the measures to be effective and competitive to commonly used entropy-based measures.
