Asymptotics for conformal inference
Ulysse Gazin
TL;DR
The paper resolves the exact asymptotic distribution of conformal inference error processes when both calibration and test samples grow, revealing a Brownian-bridge limit after scaling by $\sqrt{\tau_{n,m}}$ with $\tau_{n,m}=nm/(n+m)$ and connecting the limit to the Kolmogorov distribution. It shows how distribution shift inflates variance and shifts the mean via the function $G$, and how weighted conformal p-values—especially the oracle weights—can restore nominal behavior in the limit. The authors extend the results to novelty detection, deriving full asymptotic distributions for FDP/TDP under BH, including weighted and non-oracle scenarios. These results provide practical, asymptotically-valid quantiles for conformal inference in large-sample regimes and quantify the impact of shifts and weighting on inference accuracy. The framework lays a foundation for future work on estimated weights and broader dependency structures in conformal procedures.
Abstract
Conformal inference is a versatile tool for building prediction sets in regression or classification. We study the false coverage proportion (FCP) in a transductive setting with a calibration sample of $n$ points and a test sample of $m$ points. We identify the exact, distribution-free, asymptotic distribution of the FCP when both $n$ and $m$ tend to infinity. This shows in particular that FCP control can be achieved by using the well-known Kolmogorov distribution, and puts forward that the asymptotic variance is decreasing in the ratio $n/m$. We then provide a number of extensions by considering the novelty detection problem, weighted conformal inference and distribution shift between the calibration sample and the test sample. In particular, our asymptotic results allow to accurately quantify the asymptotic behavior of the errors when weighted conformal inference is used.
