Rejection via Learning Density Ratios
Alexander Soen, Hisham Husain, Philip Schulz, Vu Nguyen
TL;DR
This work reframes classification with rejection as learning density ratios between an idealized, regularized distribution $\mathrm{Q}$ and the data distribution $\mathrm{P}$, using $\varphi$-divergence regularization to define the ratio $\rho = d\mathrm{Q}/d\mathrm{P}$. By deriving closed-form density-ratio rejectors under KL and $\alpha$-divergences, and by tying these to Generalized Variational Inference and Distributionally Robust Optimization, the authors recover classical rejection policies (Chow's rule) in the Bayes-optimal limit and enable practical, post-hoc rejection using calibrated posteriors. The approach is validated across six datasets with varying noise, showing competitive or superior accuracy-coverage trade-offs and providing insights into calibration and robustness in selective prediction. Overall, the framework offers a principled, distributional path from risk minimization with rejection to density-ratio based decision rules that can flexibly integrate with pretrained classifiers and robustness considerations in high-stakes settings.
Abstract
Classification with rejection emerges as a learning paradigm which allows models to abstain from making predictions. The predominant approach is to alter the supervised learning pipeline by augmenting typical loss functions, letting model rejection incur a lower loss than an incorrect prediction. Instead, we propose a different distributional perspective, where we seek to find an idealized data distribution which maximizes a pretrained model's performance. This can be formalized via the optimization of a loss's risk with a $\varphi$-divergence regularization term. Through this idealized distribution, a rejection decision can be made by utilizing the density ratio between this distribution and the data distribution. We focus on the setting where our $\varphi$-divergences are specified by the family of $α$-divergence. Our framework is tested empirically over clean and noisy datasets.
