Machine Learning with a Reject Option: A survey
Kilian Hendrickx, Lorenzo Perini, Dries Van der Plas, Wannes Meert, Jesse Davis
TL;DR
This survey addresses the problem of machine learning with a reject option, focusing on how and when models should abstain to avoid costly mispredictions. It introduces three architectural families (separated, dependent, integrated) for operationalizing abstention, formalizes two rejection types (ambiguity and novelty), and reviews evaluation schemes (fixed-rate, trade-off curves, and cost-based). The paper also surveys learning strategies for each architecture, strategies for combining multiple rejectors, and highlights applications in biomedicine, engineering, economics, and image analysis, while linking rejection to uncertainty quantification, anomaly detection, active learning, open-world learning, delegation, and meta-learning. Finally, it outlines research questions and future directions, including standard benchmarks, partial rejection, and extending rejection techniques beyond classification. The practical impact is to provide a structured framework for building trustworthy, cost-aware systems that can safely defer uncertain decisions to human experts or alternative processes.
Abstract
Machine learning models always make a prediction, even when it is likely to be inaccurate. This behavior should be avoided in many decision support applications, where mistakes can have severe consequences. Albeit already studied in 1970, machine learning with rejection recently gained interest. This machine learning subfield enables machine learning models to abstain from making a prediction when likely to make a mistake. This survey aims to provide an overview on machine learning with rejection. We introduce the conditions leading to two types of rejection, ambiguity and novelty rejection, which we carefully formalize. Moreover, we review and categorize strategies to evaluate a model's predictive and rejective quality. Additionally, we define the existing architectures for models with rejection and describe the standard techniques for learning such models. Finally, we provide examples of relevant application domains and show how machine learning with rejection relates to other machine learning research areas.
