Training-Conditional Coverage Bounds under Covariate Shift
Mehrdad Pournaderi, Yu Xiang
TL;DR
This work derives PAC-style bounds for the training-conditional miscoverage of conformal prediction methods under covariate shift, addressing a gap where prior results emphasized marginal coverage. By introducing likelihood-ratio weighting and weighted DKW inequalities, the authors obtain explicit tail guarantees for split conformal, and analogous bounds for full conformal and jackknife+ under uniform stability and bi-Lipschitz assumptions. The bounds transparently depend on the shift severity via an upper bound $B$ on the likelihood ratio $dQ/dP$ and on calibration or sample sizes, providing a practical sense of how much data is needed to maintain training-conditional reliability under shift. The results illuminate the trade-offs and robustness considerations when using weighted conformal methods in shifted environments and suggest avenues for estimating likelihood ratios and extending the framework to robustness and SSL settings.
Abstract
Conformal prediction methodology has recently been extended to the covariate shift setting, where the distribution of covariates differs between training and test data. While existing results ensure that the prediction sets from these methods achieve marginal coverage above a nominal level, their coverage rate conditional on the training dataset (referred to as training-conditional coverage) remains unexplored. In this paper, we address this gap by deriving upper bounds on the tail of the training-conditional coverage distribution, offering probably approximately correct (PAC) guarantees for these methods. Our results characterize the reliability of the prediction sets in terms of the severity of distributional changes and the size of the training dataset.
