Quantifying Deep Learning Model Uncertainty in Conformal Prediction
Hamed Karimi, Reza Samavi
TL;DR
The paper addresses reliable predictive uncertainty in high-stakes settings like medical AI by leveraging Conformal Prediction (CP), which provides prediction sets $\mathcal{C}(x,\widehat{q})$ with marginal coverage guarantees $P(y \in \mathcal{C}) \ge 1-\delta$ via calibration data and a score $s(x,y)$. It introduces a probabilistic uncertainty-quantification framework for CP, defining conformal model uncertainty $U_{\mathcal{C}}(x_{val})$ with bounds $\mathcal{L}_{\mathcal{C}}$ and $\mathcal{H}_{\mathcal{C}}$ based on the prediction-set size $m$, number of classes $K$, calibration size $n$, and coverage error $\delta$, and a pure-model uncertainty $\widehat{u}_{\mathcal{C}} = \frac{m+\delta-1}{K}$. A special case of $m=0$ yields maximal uncertainty, while larger $m$ and larger $\delta$ increase the bound width and hence the quantified uncertainty, captured by $d_{\mathcal{C}} = \frac{1+\widehat{u}_{\mathcal{C}}}{n+1}$. The approach enables comparisons with Bayesian and Evidential UQ methods and provides a principled, certified framework for uncertainty in CP-based classification, with practical implications for medical decision-making; it also discusses related CP variants APS and RAPS that improve adaptivity and reduce set size while maintaining coverage.
Abstract
Precise estimation of predictive uncertainty in deep neural networks is a critical requirement for reliable decision-making in machine learning and statistical modeling, particularly in the context of medical AI. Conformal Prediction (CP) has emerged as a promising framework for representing the model uncertainty by providing well-calibrated confidence levels for individual predictions. However, the quantification of model uncertainty in conformal prediction remains an active research area, yet to be fully addressed. In this paper, we explore state-of-the-art CP methodologies and their theoretical foundations. We propose a probabilistic approach in quantifying the model uncertainty derived from the produced prediction sets in conformal prediction and provide certified boundaries for the computed uncertainty. By doing so, we allow model uncertainty measured by CP to be compared by other uncertainty quantification methods such as Bayesian (e.g., MC-Dropout and DeepEnsemble) and Evidential approaches.
