Conformal Prediction for Ensembles: Improving Efficiency via Score-Based Aggregation

Eduardo Ochoa Rivera; Yash Patel; Ambuj Tewari

Conformal Prediction for Ensembles: Improving Efficiency via Score-Based Aggregation

Eduardo Ochoa Rivera, Yash Patel, Ambuj Tewari

TL;DR

This work tackles distribution-free uncertainty quantification for ensembles by extending conformal prediction to multivariate score space. It introduces Conformal Score Aggregation (CSA), which constructs a data-driven, convex, score-space quantile envelope using a nested family of score-frontier sets and a two-stage calibration to preserve exchangeability. CSA yields more informative prediction regions than naive aggregation while maintaining formal coverage, demonstrated across ImageNet classification, OpenML regression, and a predict-then-optimize traffic routing task. The approach offers a practical, scalable framework for uncertainty estimation in multi-modal ensembles with downstream decision-making implications.

Abstract

Distribution-free uncertainty estimation for ensemble methods is increasingly desirable due to the widening deployment of multi-modal black-box predictive models. Conformal prediction is one approach that avoids such distributional assumptions. Methods for conformal aggregation have in turn been proposed for ensembled prediction, where the prediction regions of individual models are merged as to retain coverage guarantees while minimizing conservatism. Merging the prediction regions directly, however, sacrifices structures present in the conformal scores that can further reduce conservatism. We, therefore, propose a novel framework that extends the standard scalar formulation of a score function to a multivariate score that produces more efficient prediction regions. We then demonstrate that such a framework can be efficiently leveraged in both classification and predict-then-optimize regression settings downstream and empirically show the advantage over alternate conformal aggregation methods.

Conformal Prediction for Ensembles: Improving Efficiency via Score-Based Aggregation

TL;DR

Abstract

Paper Structure (31 sections, 2 theorems, 16 equations, 10 figures, 8 tables, 2 algorithms)

This paper contains 31 sections, 2 theorems, 16 equations, 10 figures, 8 tables, 2 algorithms.

Introduction
Background
Conformal Prediction
Quantile Envelopes
Predict-Then-Optimize
Related Works
Method
Multivariate Score Quantile
Score Partial Ordering
Score Quantile Threshold
Predict-Then-Optimize
Experiments
Classification Tasks
Regression Tasks
CSA Predict-Then-Optimize
...and 16 more sections

Key Result

Theorem 3.1

Suppose $\mathcal{D}_{\mathcal{C}} := \{(X_i,Y_i)\}_{i=1}^{N_{\mathcal{C}}}$ and $(X',Y')$ are exchangeable. Assume further that $K$ maps $s_{k} : \mathcal{X}\times\mathcal{Y}\rightarrow\mathbb{R}$ have been defined and a composite $s(X,Y) := (s_1(X,Y),...,s_{K}(X,Y))$ is defined. Further denote by

Figures (10)

Figure 1: CSA provides a principled extension to the standard conformal prediction pipeline by leveraging ideas from higher-dimensional quantile regression to define quantile envelopes $\widehat{\mathcal{Q}}$ instead of scalar quantiles $\widehat{q}$. It does so by evaluating a collection of score functions (here $s_1$ and $s_2$) over the calibration dataset to define $\mathcal{S}$, finding quantiles $\{\widehat{q}_{m}\}$ over a set of projection directions $\{u_{m}\}$, and taking $\widehat{\mathcal{Q}}$ to be the intersection of the resulting half-planes $H(u_{m},\widehat{q}_{m})$. These quantile envelopes result in more informative prediction regions that can be used in downstream tasks.
Figure 2: The calibration score evaluations are first split between those used to define the pre-ordering (green) $\mathcal{S}_C^{(1)}$ and those used to define the final multivariate quantile (red) $\mathcal{S}_C^{(2)}$.
Figure 3: The pre-ordering points are projected across a number of directions, after which the $\beta$ quantile is used to define a direction quantile. This defines a half-plane of points that are in the region (blue) and those outside (red).
Figure 4: We use the intersection of hyperplanes to define the quantile envelope, seeking $\beta^{*}$ that achieves the desired coverage.
Figure 5: Using the quantile envelope, the family of nested sets $\mathcal{A}_t$ is defined, in turn defining a partial ordering over $\mathbb{R}^{K}$.
...and 5 more figures

Theorems & Definitions (3)

Theorem 3.1
Theorem B.1
proof

Conformal Prediction for Ensembles: Improving Efficiency via Score-Based Aggregation

TL;DR

Abstract

Conformal Prediction for Ensembles: Improving Efficiency via Score-Based Aggregation

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (10)

Theorems & Definitions (3)