Beyond Uncertainty Sets: Leveraging Optimal Transport to Extend Conformal Predictive Distribution to Multivariate Settings
Eugene Ndiaye
TL;DR
This work tackles extending conformal prediction (CP) from scalar to multivariate outcomes by leveraging optimal transport (OT) to define center-outward ranks and multivariate quantile regions. It introduces a conformalized vector-valued score framework where the candidate’s rank is defined via an augmented OT map, ensuring exact finite-sample, distribution-free coverage. A key technical contribution is a tractable algorithm that precomputes a fixed polyhedral partition of the score space, enabling efficient prediction-set construction without solving OT for every candidate. Building on this backbone, the authors construct multivariate conformal predictive distributions (CPDs) with finite-sample calibration, including randomized variants that generalize the Dempster–Hill procedure to higher dimensions. Collectively, the paper provides distribution-free multivariate predictive distributions and calibrated prediction regions, advancing principled uncertainty quantification for multivariate tasks with practical, scalable algorithms.
Abstract
Conformal prediction (CP) constructs uncertainty sets for model outputs with finite-sample coverage guarantees. A candidate output is included in the prediction set if its non-conformity score is not considered extreme relative to the scores observed on a set of calibration examples. However, this procedure is only straightforward when scores are scalar-valued, which has limited CP to real-valued scores or ad-hoc reductions to one dimension. The problem of ordering vectors has been studied via optimal transport (OT), which provides a principled method for defining vector-ranks and multivariate quantile regions, though typically with only asymptotic coverage guarantees. We restore finite-sample, distribution-free coverage by conformalizing the vector-valued OT quantile region. Here, a candidate's rank is defined via a transport map computed for the calibration scores augmented with that candidate's score. This defines a continuum of OT problems for which we prove that the resulting optimal assignment is piecewise-constant across a fixed polyhedral partition of the score space. This allows us to characterize the entire prediction set tractably, and provides the machinery to address a deeper limitation of prediction sets: that they only indicate which outcomes are plausible, but not their relative likelihood. In one dimension, conformal predictive distributions (CPDs) fill this gap by producing a predictive distribution with finite-sample calibration. Extending CPDs beyond one dimension remained an open problem. We construct, to our knowledge, the first multivariate CPDs with finite-sample calibration, i.e., they define a valid multivariate distribution where any derived uncertainty region automatically has guaranteed coverage. We present both conservative and exact randomized versions, the latter resulting in a multivariate generalization of the classical Dempster-Hill procedure.
