Epistemic Reject Option Prediction
Vojtech Franc, Jakub Paplham
TL;DR
This work addresses high-stakes prediction under finite data by formulating an epistemic reject-option predictor that abstains when epistemic uncertainty, arising from limited data, makes reliable decision-making unlikely. It shifts the learning objective from risk minimization to regret minimization, defining a regret-based reject loss and proving that the epistemic predictor minimizes Bayesian expected regret $R_B^\delta(Q)$ via an epistemic uncertainty threshold $E(x,D)$. The approach unifies and theoretically justifies commonly used uncertainty measures (entropy and variance) as instances of conditional regret under specific losses, and contrasts it with traditional Bayesian rejection that relies on total uncertainty $T(x,D)$. Empirical validation on a synthetic regression task shows the epistemic predictor consistently achieves lower regret (AuReC) across varying data sizes, highlighting its practical potential for selective prediction in data-scarce regimes. The framework thus provides a principled, data-aware mechanism to identify inputs where training data is insufficient for reliable predictions, with implications for safe, deployable AI systems.
Abstract
In high-stakes applications, predictive models must not only produce accurate predictions but also quantify and communicate their uncertainty. Reject-option prediction addresses this by allowing the model to abstain when prediction uncertainty is high. Traditional reject-option approaches focus solely on aleatoric uncertainty, an assumption valid only when large training data makes the epistemic uncertainty negligible. However, in many practical scenarios, limited data makes this assumption unrealistic. This paper introduces the epistemic reject-option predictor, which abstains in regions of high epistemic uncertainty caused by insufficient data. Building on Bayesian learning, we redefine the optimal predictor as the one that minimizes expected regret -- the performance gap between the learned model and the Bayes-optimal predictor with full knowledge of the data distribution. The model abstains when the regret for a given input exceeds a specified rejection cost. To our knowledge, this is the first principled framework that enables learning predictors capable of identifying inputs for which the training data is insufficient to make reliable decisions.
