Non-exchangeable Conformal Prediction with Optimal Transport: Tackling Distribution Shifts with Unlabeled Data
Alvaro H. C. Correia, Christos Louizos
TL;DR
This work tackles the breakdown of conformal prediction guarantees under distribution shifts by introducing total coverage gap as a unifying metric and deriving OT-based bounds that relate miscoverage to score-distribution differences. It develops two main bounds: a density-CDF bound and a 1-Wasserstein bound, plus a novel unlabeled-data bound that relies on auxiliary score distributions derived from unlabeled test data, enabling label-free calibration. The authors then learn calibration weights by optimizing these bounds, using labeled calibration data and unlabeled test data, and extend the framework to regression. Empirically, the method yields substantial reductions in coverage gap on synthetic regression and real-world datasets such as ImageNet-C and iWildCam, often outperforming likelihood-ratio, entropy-scaled, and conditional-conformal baselines while preserving interpretation of prediction sets. Overall, the approach provides a general, practical framework for robust uncertainty quantification under shifts with minimal labeled data from the test distribution.
Abstract
Conformal prediction is a distribution-free uncertainty quantification method that has gained popularity in the machine learning community due to its finite-sample guarantees and ease of use. Its most common variant, dubbed split conformal prediction, is also computationally efficient as it boils down to collecting statistics of the model predictions on some calibration data not yet seen by the model. Nonetheless, these guarantees only hold if the calibration and test data are exchangeable, a condition that is difficult to verify and often violated in practice due to so-called distribution shifts. The literature is rife with methods to mitigate the loss in coverage in this non-exchangeable setting, but these methods require some prior information on the type of distribution shift to be expected at test time. In this work, we study this problem via a new perspective, through the lens of optimal transport, and show that it is possible to estimate the loss in coverage and mitigate arbitrary distribution shifts, offering a principled and broadly applicable solution.
