Online Selective Conformal Prediction: Errors and Solutions
Yusuf Sale, Aaditya Ramdas
TL;DR
The paper addresses uncertainty quantification under online selective reporting, where only selected observations receive prediction intervals. It reveals that many calibration strategies fail to preserve exchangeability with the selected test datum, undermining selection-conditional coverage and FCR control. To fix this, it introduces exchangeability-preserving calibration strategies—EXPRESS, K_EXPRESS, and EXPRESS_M—along with a merging approach, proving strong selection-conditional coverage and provable FCR control under these schemes. Empirical results illustrate the trade-offs between calibration-set size and interval informativeness, and compare these methods to conformal LORD-CI and adaptive conformal inference, highlighting practical guidance for robust online selective conformal prediction.
Abstract
In online selective conformal inference, data arrives sequentially, and prediction intervals are constructed only when an online selection rule is met. Since online selections may break the exchangeability between the selected test datum and the rest of the data, one must correct for this by suitably selecting the calibration data. In this paper, we evaluate existing calibration selection strategies and pinpoint some fundamental errors in the associated claims that guarantee selection-conditional coverage and control of the false coverage rate (FCR). To address these shortcomings, we propose novel calibration selection strategies that provably preserve the exchangeability of the calibration data and the selected test datum. Consequently, we demonstrate that online selective conformal inference with these strategies guarantees both selection-conditional coverage and FCR control. Our theoretical findings are supported by experimental evidence examining tradeoffs between valid methods.
