Label-wise Aleatoric and Epistemic Uncertainty Quantification

Yusuf Sale; Paul Hofman; Timo Löhr; Lisa Wimmer; Thomas Nagler; Eyke Hüllermeier

Label-wise Aleatoric and Epistemic Uncertainty Quantification

Yusuf Sale, Paul Hofman, Timo Löhr, Lisa Wimmer, Thomas Nagler, Eyke Hüllermeier

TL;DR

The paper addresses the challenge of quantifying predictive uncertainty in multiclass classification by introducing a label-wise decomposition based on second-order distributions $Q \in \Delta_K^{(2)}$, separating total, aleatoric, and epistemic uncertainty at the class level. It develops both entropy-based and variance-based per-label formulations, with a loss-based framework that can instantiate $\phi$ as log-loss (yielding entropy) or squared-error (yielding variance), and aggregates these to global measures. The authors establish a comprehensive axiomatic foundation for second-order uncertainty (A0–A7, plus A6–A7) and demonstrate that both entropy- and variance-based measures satisfy many of these properties, with variance-based measures offering advantages such as addressing A5 and providing a natural fit for binary label-wise decisions. Empirically, the approach is validated on medical imaging data (PET/CT) and standard benchmarks, showing meaningful per-class uncertainty insights and competitive performance in accuracy-rejection and OoD detection tasks, while maintaining coherence with global uncertainty assessments. The work thus enables cost-sensitive decision-making, improved interpretability, and robust uncertainty quantification in safety-critical applications, with code available for reproducibility.

Abstract

We present a novel approach to uncertainty quantification in classification tasks based on label-wise decomposition of uncertainty measures. This label-wise perspective allows uncertainty to be quantified at the individual class level, thereby improving cost-sensitive decision-making and helping understand the sources of uncertainty. Furthermore, it allows to define total, aleatoric, and epistemic uncertainty on the basis of non-categorical measures such as variance, going beyond common entropy-based measures. In particular, variance-based measures address some of the limitations associated with established methods that have recently been discussed in the literature. We show that our proposed measures adhere to a number of desirable properties. Through empirical evaluation on a variety of benchmark data sets -- including applications in the medical domain where accurate uncertainty quantification is crucial -- we establish the effectiveness of label-wise uncertainty quantification.

Label-wise Aleatoric and Epistemic Uncertainty Quantification

TL;DR

The paper addresses the challenge of quantifying predictive uncertainty in multiclass classification by introducing a label-wise decomposition based on second-order distributions

, separating total, aleatoric, and epistemic uncertainty at the class level. It develops both entropy-based and variance-based per-label formulations, with a loss-based framework that can instantiate

as log-loss (yielding entropy) or squared-error (yielding variance), and aggregates these to global measures. The authors establish a comprehensive axiomatic foundation for second-order uncertainty (A0–A7, plus A6–A7) and demonstrate that both entropy- and variance-based measures satisfy many of these properties, with variance-based measures offering advantages such as addressing A5 and providing a natural fit for binary label-wise decisions. Empirically, the approach is validated on medical imaging data (PET/CT) and standard benchmarks, showing meaningful per-class uncertainty insights and competitive performance in accuracy-rejection and OoD detection tasks, while maintaining coherence with global uncertainty assessments. The work thus enables cost-sensitive decision-making, improved interpretability, and robust uncertainty quantification in safety-critical applications, with code available for reproducibility.

Abstract

Paper Structure (24 sections, 3 theorems, 18 equations, 7 figures, 2 tables)

This paper contains 24 sections, 3 theorems, 18 equations, 7 figures, 2 tables.

Introduction
Quantifying Second-Order Uncertainty
Entropy-Based Measures
Desirable Properties
Label-wise Uncertainty Quantification
Entropy-based Measures
Variance-based Measures
Experiments
Label-wise Uncertainties
Accuracy-Rejection Curves
Out-of-Distribution Detection
Concluding Remarks
Proofs
Experimental Details
Experiments on ML Data sets
...and 9 more sections

Key Result

Theorem 3.1

Entropy-based measures tu:label_entropy, au:label_entropy, and eu:label_entropy satisfy Axioms A0, A1, A2 (only for $\text{TU}$), A3 (strict version), A4 (strict version, only for $\text{TU}$), A6, and A7.

Figures (7)

Figure 1: Label-wise aleatoric and epistemic uncertainties for MNIST instances.
Figure 2: Coronal 2D image from a patient with malignant lesions. Left: CT, middle: PET with segmentation by medical experts, right: corresponding aleatoric and epistemic uncertainties, ground truth class and predicted class.
Figure 3: Accuracy-rejection curves generated on the CIFAR10 data set. We compare entropy ($\text{TU}_{\text{ent}}$), label-wise constructed entropy ($\text{TU}_{\text{lent}}$), and the variance-based ($\text{TU}_{\text{var}}$) measure of total uncertainty.
Figure 4: Medical image examples with highest total (top), aleatoric(middle) and epistemic uncertainties (bottom) along with their corresponding label-wise uncertainties, ground truth class and predicted class.
Figure 5: MNIST instances with highest total (top), aleatoric (middle), and epistemic uncertainties (bottom) along with their corresponding label-wise uncertainties.
...and 2 more figures

Theorems & Definitions (7)

Definition 2.1
Theorem 3.1
Theorem 3.2
Proposition A.1
proof
proof : Proof of Theorem \ref{['thm:entropy_axioms']}
proof : Proof of Theorem \ref{['thm:axioms']}

Label-wise Aleatoric and Epistemic Uncertainty Quantification

TL;DR

Abstract

Label-wise Aleatoric and Epistemic Uncertainty Quantification

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (7)

Theorems & Definitions (7)