Table of Contents
Fetching ...

Uncertainty in Machine Learning

Hans Weytjens, Wouter Verbeke

TL;DR

Uncertainty in Machine Learning surveys the distinction between epistemic and aleatoric uncertainty and presents practical methods to quantify and manage uncertainty across linear regression, random forests, and neural networks. It introduces conformal prediction as a model-agnostic framework for producing prediction sets and intervals with finite-sample guarantees, and demonstrates how to use entropy-based measures and accuracy-rejection curves to calibrate and compare uncertainty estimates. The chapter also discusses Bayesian neural networks and dropout-based approximations as scalable means to capture both epistemic and aleatoric uncertainty in regression and classification, with an explicit loss formulation for aleatoric noise. The practical impact is to enable risk-aware decision making, automated triage, and earlier deployment of prediction systems by leveraging quantified uncertainty and calibration tools.

Abstract

This book chapter introduces the principles and practical applications of uncertainty quantification in machine learning. It explains how to identify and distinguish between different types of uncertainty and presents methods for quantifying uncertainty in predictive models, including linear regression, random forests, and neural networks. The chapter also covers conformal prediction as a framework for generating predictions with predefined confidence intervals. Finally, it explores how uncertainty estimation can be leveraged to improve business decision-making, enhance model reliability, and support risk-aware strategies.

Uncertainty in Machine Learning

TL;DR

Uncertainty in Machine Learning surveys the distinction between epistemic and aleatoric uncertainty and presents practical methods to quantify and manage uncertainty across linear regression, random forests, and neural networks. It introduces conformal prediction as a model-agnostic framework for producing prediction sets and intervals with finite-sample guarantees, and demonstrates how to use entropy-based measures and accuracy-rejection curves to calibrate and compare uncertainty estimates. The chapter also discusses Bayesian neural networks and dropout-based approximations as scalable means to capture both epistemic and aleatoric uncertainty in regression and classification, with an explicit loss formulation for aleatoric noise. The practical impact is to enable risk-aware decision making, automated triage, and earlier deployment of prediction systems by leveraging quantified uncertainty and calibration tools.

Abstract

This book chapter introduces the principles and practical applications of uncertainty quantification in machine learning. It explains how to identify and distinguish between different types of uncertainty and presents methods for quantifying uncertainty in predictive models, including linear regression, random forests, and neural networks. The chapter also covers conformal prediction as a framework for generating predictions with predefined confidence intervals. Finally, it explores how uncertainty estimation can be leveraged to improve business decision-making, enhance model reliability, and support risk-aware strategies.

Paper Structure

This paper contains 30 sections, 19 equations, 14 figures, 8 tables.

Figures (14)

  • Figure 1: Examples of epistemic and aleatoric uncertainty for a regression task.
  • Figure 2: Examples of epistemic and aleatoric uncertainty for a classification task. Image credit: hullermeier2021aleatoric
  • Figure 3: Confidence intervals: base case and lower confidence level
  • Figure 4: Confidence intervals: adding epistemic and aleatoric uncertainty
  • Figure 5: : Limitations of confidence intervals: heteroscedasticity and extrapolation.
  • ...and 9 more figures