Training-Conditional Coverage Bounds under Covariate Shift

Mehrdad Pournaderi; Yu Xiang

Training-Conditional Coverage Bounds under Covariate Shift

Mehrdad Pournaderi, Yu Xiang

TL;DR

This work derives PAC-style bounds for the training-conditional miscoverage of conformal prediction methods under covariate shift, addressing a gap where prior results emphasized marginal coverage. By introducing likelihood-ratio weighting and weighted DKW inequalities, the authors obtain explicit tail guarantees for split conformal, and analogous bounds for full conformal and jackknife+ under uniform stability and bi-Lipschitz assumptions. The bounds transparently depend on the shift severity via an upper bound $B$ on the likelihood ratio $dQ/dP$ and on calibration or sample sizes, providing a practical sense of how much data is needed to maintain training-conditional reliability under shift. The results illuminate the trade-offs and robustness considerations when using weighted conformal methods in shifted environments and suggest avenues for estimating likelihood ratios and extending the framework to robustness and SSL settings.

Abstract

Conformal prediction methodology has recently been extended to the covariate shift setting, where the distribution of covariates differs between training and test data. While existing results ensure that the prediction sets from these methods achieve marginal coverage above a nominal level, their coverage rate conditional on the training dataset (referred to as training-conditional coverage) remains unexplored. In this paper, we address this gap by deriving upper bounds on the tail of the training-conditional coverage distribution, offering probably approximately correct (PAC) guarantees for these methods. Our results characterize the reliability of the prediction sets in terms of the severity of distributional changes and the size of the training dataset.

Training-Conditional Coverage Bounds under Covariate Shift

TL;DR

on the likelihood ratio

and on calibration or sample sizes, providing a practical sense of how much data is needed to maintain training-conditional reliability under shift. The results illuminate the trade-offs and robustness considerations when using weighted conformal methods in shifted environments and suggest avenues for estimating likelihood ratios and extending the framework to robustness and SSL settings.

Abstract

Paper Structure (22 sections, 19 theorems, 142 equations, 2 figures)

This paper contains 22 sections, 19 theorems, 142 equations, 2 figures.

Introduction
Background and Related Work
Full conformal and split conformal
Jackknife+
Conformal prediction under distribution shift
Conditional Coverage Guarantees
Split conformal
Full conformal and jackknife+
Discussion
Conclusion
Technical Lemmas: Weighted DKW Inequalities
Background
Weighted DKW Inequalities
Proof of Theorem \ref{['thm:splt']}
Proof of Remark \ref{['splt_alt']}
...and 7 more sections

Key Result

Theorem 1

Let $m<n$ denote the size of the calibration data set. Assume that $Q$ is absolutely continuous with respect to $P$ ($Q \ll P$) and $dQ/dP\leq B<\infty$. Then, for all $\delta > 0$, where $C>0$ is a universal constant. The probability is taken with respect to $P$ since each entry in ${\bf D}_n$ follows $P$.

Figures (2)

Figure 1: Histograms of the training-conditional coverage rates are presented for four datasets from the UCI Machine Learning Repository: Wine Quality (top left), Abalone (top right), Concrete Compressive Strength (bottom left), and Combined Cycle Power Plant (bottom right). See Appendix \ref{['sim_details']} for details of this simulation study.
Figure 2: The likelihood ratio is estimated using kernel density estimator with Gaussian kernel.

Theorems & Definitions (20)

Theorem 1
Corollary 1
Theorem 2
Remark 1
Theorem 3: Jackknife+ under exchangeability
Theorem 4: Jackknife+ under covariate shift
Corollary 2: CV+
Theorem 5: Full conformal under exchangeability
Theorem 6: Full conformal under covariate shift
Theorem 7: van1997weak(Theorem 2.14.2)
...and 10 more

Training-Conditional Coverage Bounds under Covariate Shift

TL;DR

Abstract

Training-Conditional Coverage Bounds under Covariate Shift

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (2)

Theorems & Definitions (20)