Minimum Volume Conformal Sets for Multivariate Regression
Sacha Braun, Liviu Aolaritei, Michael I. Jordan, Francis Bach
TL;DR
This paper tackles the problem of constructing valid multivariate predictive sets with finite-sample guarantees while minimizing their volume. It introduces Minimum-Volume Conformal Sets (MVCS), an optimization-driven framework that learns the smallest-volume region defined by arbitrary norm balls, including learning the norm parameter p and multi-norm configurations, and jointly optimizes the predictor and the uncertainty representation. The approach combines DC-based and convex-relaxation techniques to handle nonconvex volume minimization, and extends to local adaptivity and regression by using residual-based centers and covariate-dependent transformations, with conformalization to preserve coverage. Empirical results on synthetic and real datasets show MVCS yields tighter, well-calibrated prediction sets and competitive computational efficiency compared with baselines, highlighting its practical impact for reliable multivariate uncertainty quantification.
Abstract
Conformal prediction provides a principled framework for constructing predictive sets with finite-sample validity. While much of the focus has been on univariate response variables, existing multivariate methods either impose rigid geometric assumptions or rely on flexible but computationally expensive approaches that do not explicitly optimize prediction set volume. We propose an optimization-driven framework based on a novel loss function that directly learns minimum-volume covering sets while ensuring valid coverage. This formulation naturally induces a new nonconformity score for conformal prediction, which adapts to the residual distribution and covariates. Our approach optimizes over prediction sets defined by arbitrary norm balls, including single and multi-norm formulations. Additionally, by jointly optimizing both the predictive model and predictive uncertainty, we obtain prediction sets that are tight, informative, and computationally efficient, as demonstrated in our experiments on real-world datasets.
