Cross-Validation Conformal Risk Control
Kfir M. Cohen, Sangwoo Park, Osvaldo Simeone, Shlomo Shamai
TL;DR
The paper addresses uncertainty quantification for predictive sets under risk constraints by extending CRC to a cross-validation framework. It introduces CV-CRC, which uses K-fold cross-validation and a minimum nonconformity score across folds to form predictive sets, with a threshold chosen to bound the average risk by $\alpha$; theoretical guarantees require bounded, monotone losses and a permutation-invariant nonconformity score. The method generalizes CV-CP to arbitrary bounded risks and shows reduced average set sizes compared to VB-CRC when data are scarce, demonstrated on vector regression and temporal point process tasks. This cross-validated approach enables more data-efficient, calibration-guaranteed uncertainty quantification with potential extensions via jackknife+ and meta-learning.
Abstract
Conformal risk control (CRC) is a recently proposed technique that applies post-hoc to a conventional point predictor to provide calibration guarantees. Generalizing conformal prediction (CP), with CRC, calibration is ensured for a set predictor that is extracted from the point predictor to control a risk function such as the probability of miscoverage or the false negative rate. The original CRC requires the available data set to be split between training and validation data sets. This can be problematic when data availability is limited, resulting in inefficient set predictors. In this paper, a novel CRC method is introduced that is based on cross-validation, rather than on validation as the original CRC. The proposed cross-validation CRC (CV-CRC) extends a version of the jackknife-minmax from CP to CRC, allowing for the control of a broader range of risk functions. CV-CRC is proved to offer theoretical guarantees on the average risk of the set predictor. Furthermore, numerical experiments show that CV-CRC can reduce the average set size with respect to CRC when the available data are limited.
