Time-uniform conformal and PAC prediction
Kayla E. Scharfstein, Arun Kumar Kuchibhotla
TL;DR
This work addresses uncertainty quantification for sequential, streaming data by extending distribution-free conformal and PAC prediction to time-uniform guarantees at arbitrary stopping times. It develops split-time-uniform and online time-uniform prediction frameworks (TUC and TUPAC) that yield anytime-valid coverage around a test point, using fixed or dynamically updated transformations and carefully calibrated quantiles. The paper proves equivalence and oracle-width results, demonstrates convergence of prediction-set width to the oracle in suitable settings, and validates the methods with simulations and a real spam-detection dataset, including adaptivity to distribution shifts. These contributions enable memory-efficient, data-driven uncertainty quantification for streaming decision systems, with practical guidance on implementation and extensions to non-IID settings. Collectively, the work provides a rigorous foundation for reliable, sequential conformal and PAC predictions under stopping-time uncertainty.
Abstract
Given that machine learning algorithms are increasingly being deployed to aid in high stakes decision-making, uncertainty quantification methods that wrap around these black box models such as conformal prediction have received much attention in recent years. In sequential settings, where data are observed/generated in a streaming fashion, traditional conformal methods do not provide any guarantee without fixing the sample size. More importantly, traditional conformal methods cannot cope with sequentially updated predictions. As such, we develop an extension of the conformal prediction and related probably approximately correct (PAC) prediction frameworks to sequential settings where the number of data points is not fixed in advance. The resulting prediction sets are anytime-valid in that their expected coverage is at the required level at any time chosen by the analyst even if this choice depends on the data. We present theoretical guarantees for our proposed methods and demonstrate their validity and utility on simulated and real datasets.
