Calibrated Selective Classification
Adam Fisch, Tommi Jaakkola, Regina Barzilay
TL;DR
Calibrated Selective Classification tackles the challenge of reliable uncertainty by pairing a fixed predictor with a trainable selector to abstain on inputs whose uncertainties are not well calibrated. It introduces selective calibration and the S-MMCE objective, plus a practical upper bound, and uses a DRO-inspired regime with synthetic domain shifts to improve out-of-domain calibration while enforcing a coverage constraint. The framework is validated on image-corruption benchmarks (CIFAR-10-C, ImageNet-C) and a lung cancer risk task, consistently reducing selective calibration error compared to baselines and showing meaningful robustness to distribution shifts. The work demonstrates that calibrated abstention can yield more trustworthy predictions without requiring full retraining of the base model, with strong implications for high-stakes decision making and medical applications.
Abstract
Selective classification allows models to abstain from making predictions (e.g., say "I don't know") when in doubt in order to obtain better effective accuracy. While typical selective models can be effective at producing more accurate predictions on average, they may still allow for wrong predictions that have high confidence, or skip correct predictions that have low confidence. Providing calibrated uncertainty estimates alongside predictions -- probabilities that correspond to true frequencies -- can be as important as having predictions that are simply accurate on average. However, uncertainty estimates can be unreliable for certain inputs. In this paper, we develop a new approach to selective classification in which we propose a method for rejecting examples with "uncertain" uncertainties. By doing so, we aim to make predictions with {well-calibrated} uncertainty estimates over the distribution of accepted examples, a property we call selective calibration. We present a framework for learning selectively calibrated models, where a separate selector network is trained to improve the selective calibration error of a given base model. In particular, our work focuses on achieving robust calibration, where the model is intentionally designed to be tested on out-of-domain data. We achieve this through a training strategy inspired by distributionally robust optimization, in which we apply simulated input perturbations to the known, in-domain training data. We demonstrate the empirical effectiveness of our approach on multiple image classification and lung cancer risk assessment tasks.
