Selective classification using a robust meta-learning approach
Nishant Jain, Karthikeyan Shanmugam, Pradeep Shenoy
TL;DR
Predictive uncertainty is essential for robust performance and selective classification. The authors propose ReVaR, a unified framework that learns an instance-conditioned uncertainty score (U-Score) via a bilevel optimization, jointly training a classifier and a meta-uncertainty model to minimize dropout-variance surrogate uncertainty. A variance-minimizing meta-regularizer drives the U-Score to capture diverse uncertainty sources, enabling effective test-time uncertainty measurement and training-time reweighting. Empirically, ReVaR achieves state-of-the-art selective classification and competitive calibration across diverse datasets and domain shifts, including large pretrained models like PLEX, highlighting its practical impact for robust AI systems.
Abstract
Predictive uncertainty-a model's self awareness regarding its accuracy on an input-is key for both building robust models via training interventions and for test-time applications such as selective classification. We propose a novel instance-conditioned reweighting approach that captures predictive uncertainty using an auxiliary network and unifies these train- and test-time applications. The auxiliary network is trained using a meta-objective in a bilevel optimization framework. A key contribution of our proposal is the meta-objective of minimizing the dropout variance, an approximation of Bayesian Predictive uncertainty. We show in controlled experiments that we effectively capture the diverse specific notions of uncertainty through this meta-objective, while previous approaches only capture certain aspects. These results translate to significant gains in real-world settings-selective classification, label noise, domain adaptation, calibration-and across datasets-Imagenet, Cifar100, diabetic retinopathy, Camelyon, WILDs, Imagenet-C,-A,-R, Clothing1M, etc. For Diabetic Retinopathy, we see upto 3.4%/3.3% accuracy and AUC gains over SOTA in selective classification. We also improve upon large-scale pretrained models such as PLEX.
