Multivariate Conformal Prediction using Optimal Transport
Michal Klein, Louis Bethune, Eugene Ndiaye, Marco Cuturi
TL;DR
This work extends conformal prediction to multivariate outputs by leveraging optimal transport to define Kantorovich ranks and center-outward quantiles. It introduces OT-CP, which maps vector-valued conformity scores through an optimal transport map to a univariate score, enabling standard, distribution-free CP with finite-sample guarantees. The paper develops two practical implementations: OT-based merging using the entropic map for tractable transport estimation and a coverage-preserving scheme under approximations, with formal calibration results. Empirical evaluation on a multivariate regression benchmark shows that OT-CP often yields smaller predictive regions than baseline multivariate CP methods, albeit with higher computational cost and sensitivity to hyperparameters like the entropic regularization and sphere discretization. Overall, OT-CP provides a principled, distribution-free framework for uncertainty quantification in high-dimensional prediction tasks, expanding the applicability of conformal methods to multivariate settings.
Abstract
Conformal prediction (CP) quantifies the uncertainty of machine learning models by constructing sets of plausible outputs. These sets are constructed by leveraging a so-called conformity score, a quantity computed using the input point of interest, a prediction model, and past observations. CP sets are then obtained by evaluating the conformity score of all possible outputs, and selecting them according to the rank of their scores. Due to this ranking step, most CP approaches rely on a score functions that are univariate. The challenge in extending these scores to multivariate spaces lies in the fact that no canonical order for vectors exists. To address this, we leverage a natural extension of multivariate score ranking based on optimal transport (OT). Our method, OTCP, offers a principled framework for constructing conformal prediction sets in multidimensional settings, preserving distribution-free coverage guarantees with finite data samples. We demonstrate tangible gains in a benchmark dataset of multivariate regression problems and address computational \& statistical trade-offs that arise when estimating conformity scores through OT maps.
