Neural Optimal Transport Meets Multivariate Conformal Prediction
Vladimir Kondratyev, Alexander Fishkov, Nikita Kotelevskii, Mahmoud Hegazy, Remi Flamary, Maxim Panov, Eric Moulines
TL;DR
This work tackles conditional multivariate distribution estimation by marrying neural optimal transport with vector quantile regression, enabling geometry-aware conditional quantiles via a convex-potential map learned with ICNNs. The learned CVQF yields multivariate ranks that feed into conformal prediction, producing distribution-free predictive regions that adapt to the conditional geometry of Y|X. The authors introduce amortized optimization to accelerate training and inference, and explore entropic and non-entropic variants, achieving tighter, more informative CP regions than coordinatewise or density-only approaches. Empirical results on synthetic and real multi-target datasets demonstrate improved coverage-utility trade-offs and scalability to high-dimensional targets. Overall, the framework offers a principled, scalable path to multivariate uncertainty quantification with robust finite-sample guarantees and geometry-aware predictive sets.
Abstract
We propose a framework for conditional vector quantile regression (CVQR) that combines neural optimal transport with amortized optimization, and apply it to multivariate conformal prediction. Classical quantile regression does not extend naturally to multivariate responses, while existing approaches often ignore the geometry of joint distributions. Our method parametrizes the conditional vector quantile function as the gradient of a convex potential implemented by an input-convex neural network, ensuring monotonicity and uniform ranks. To reduce the cost of solving high-dimensional variational problems, we introduced amortized optimization of the dual potentials, yielding efficient training and faster inference. We then exploit the induced multivariate ranks for conformal prediction, constructing distribution-free predictive regions with finite-sample validity. Unlike coordinatewise methods, our approach adapts to the geometry of the conditional distribution, producing tighter and more informative regions. Experiments on benchmark datasets show improved coverage-efficiency trade-offs compared to baselines, highlighting the benefits of integrating neural optimal transport with conformal prediction.
