Conformal Risk Minimization with Variance Reduction
Sima Noorani, Orlando Romero, Nicolo Dal Fabbro, Hamed Hassani, George J. Pappas
TL;DR
This paper addresses conformal risk minimization (CRM) by training models to produce efficient conformal prediction (CP) sets with guaranteed coverage. It analyzes ConfTr, reveals that its gradient estimator suffers from non-vanishing variance due to the population quantile gradient estimation, and proposes VR-ConfTr, which decouples tau(θ) estimation from gradient computation and uses a plug-in, variance-reduced estimator based on an epsilon-thresholded average of conformity-score gradients. The authors prove bias-variance trade-offs and provide practical guidelines (e.g., m-ranking) to tune the estimator, showing provable sample efficiency. Empirically, VR-ConfTr achieves faster convergence and consistently smaller CP prediction sets across multiple benchmarks (MNIST, Fashion-MNIST, KMNIST, OrganAMNIST, CIFAR-10) with comparable accuracy, highlighting its potential to improve CP-based uncertainty quantification in real-world tasks.
Abstract
Conformal prediction (CP) is a distribution-free framework for achieving probabilistic guarantees on black-box models. CP is generally applied to a model post-training. Recent research efforts, on the other hand, have focused on optimizing CP efficiency during training. We formalize this concept as the problem of conformal risk minimization (CRM). In this direction, conformal training (ConfTr) by Stutz et al.(2022) is a technique that seeks to minimize the expected prediction set size of a model by simulating CP in-between training updates. Despite its potential, we identify a strong source of sample inefficiency in ConfTr that leads to overly noisy estimated gradients, introducing training instability and limiting practical use. To address this challenge, we propose variance-reduced conformal training (VR-ConfTr), a CRM method that incorporates a variance reduction technique in the gradient estimation of the ConfTr objective function. Through extensive experiments on various benchmark datasets, we demonstrate that VR-ConfTr consistently achieves faster convergence and smaller prediction sets compared to baselines.
