Ask for More Than Bayes Optimal: A Theory of Indecisions for Classification
Mohamed Ndaoud, Peter Radchenko, Bradley Rava
TL;DR
This work develops a principled theory of selective classification that achieves user-specified accuracy by abstaining on indecisions, quantified through the minimax risk ${\mathcal{R}}(\gamma)$ and the optimal rule $Y_{\gamma}^*$. It connects selective classification to Neyman–Pearson testing, providing finite-sample calibration methods for both classification and hypothesis testing, with rigorous guarantees. A key theoretical contribution is a sharp phase transition in the Gaussian mixture setting, showing near-Bayes accuracy can be obtained with vanishing indecision mass, and the framework is extended to plug-in rules, MLR-adaptations, and multi-class problems. Empirical results on Gaussian mixtures and real data (COMPAS) confirm that even small indecision masses yield meaningful accuracy gains while maintaining risk control, making indecision a practical tool for high-stakes risk management.
Abstract
Selective classification is a powerful tool for automated decision-making in high-risk scenarios, allowing classifiers to act only when confident and abstain when uncertainty is high. Given a target accuracy, our goal is to minimize indecisions, observations we do not automate. For difficult problems, the target accuracy may be unattainable without abstention. By using indecisions, we can control the misclassification rate to any user-specified level, even below the Bayes optimal error rate, while minimizing overall indecision mass. We provide a complete characterization of the minimax risk in selective classification, establishing continuity and monotonicity properties that enable optimal indecision selection. We revisit selective inference via the Neyman-Pearson testing framework, where indecision enables control of type 2 error given fixed type 1 error probability. For both classification and testing, we propose a finite-sample calibration method with non-asymptotic guarantees, proving plug-in classifiers remain consistent and that accuracy-based calibration effectively controls indecision mass. In the binary Gaussian mixture model, we uncover the first sharp phase transition in selective inference, showing minimal indecision can yield near-optimal accuracy even under poor class separation. Experiments on Gaussian mixtures and real datasets confirm that small indecision proportions yield substantial accuracy gains, making indecision a principled tool for risk control.
