Conformal Prediction under Levy-Prokhorov Distribution Shifts: Robustness to Local and Global Perturbations
Liviu Aolaritei, Zheyu Oliver Wang, Julie Zhu, Michael I. Jordan, Youssef Marzouk
TL;DR
This work addresses the challenge of conformal prediction under distribution shifts by modeling uncertainty with Lévy–Prokhorov (LP) ambiguity sets around the training distribution. By propagating LP shifts through the nonconformity scoring function, the authors reduce complex input–label perturbations to a one-dimensional shift in score space and derive closed-form worst-case quantiles and coverage. They then construct distributionally robust conformal prediction sets with explicit dependence on LP parameters, and prove finite-sample guarantees that degrade gracefully with global perturbations while local perturbations adjust interval width. Empirical results on MNIST, ImageNet, and iWildCam demonstrate valid coverage and competitive set sizes under both synthetic and real-world shifts, with a data-driven procedure to estimate LP parameters from data. The approach offers a principled, hypothesis-light framework for robust prediction intervals that do not rely on likelihood ratios or absolute continuity and can accommodate broad, combined local-global distribution shifts.
Abstract
Conformal prediction provides a powerful framework for constructing prediction intervals with finite-sample guarantees, yet its robustness under distribution shifts remains a significant challenge. This paper addresses this limitation by modeling distribution shifts using Levy-Prokhorov (LP) ambiguity sets, which capture both local and global perturbations. We provide a self-contained overview of LP ambiguity sets and their connections to popular metrics such as Wasserstein and Total Variation. We show that the link between conformal prediction and LP ambiguity sets is a natural one: by propagating the LP ambiguity set through the scoring function, we reduce complex high-dimensional distribution shifts to manageable one-dimensional distribution shifts, enabling exact quantification of worst-case quantiles and coverage. Building on this analysis, we construct robust conformal prediction intervals that remain valid under distribution shifts, explicitly linking LP parameters to interval width and confidence levels. Experimental results on real-world datasets demonstrate the effectiveness of the proposed approach.
