Conformal Prediction for Ensembles: Improving Efficiency via Score-Based Aggregation
Eduardo Ochoa Rivera, Yash Patel, Ambuj Tewari
TL;DR
This work tackles distribution-free uncertainty quantification for ensembles by extending conformal prediction to multivariate score space. It introduces Conformal Score Aggregation (CSA), which constructs a data-driven, convex, score-space quantile envelope using a nested family of score-frontier sets and a two-stage calibration to preserve exchangeability. CSA yields more informative prediction regions than naive aggregation while maintaining formal coverage, demonstrated across ImageNet classification, OpenML regression, and a predict-then-optimize traffic routing task. The approach offers a practical, scalable framework for uncertainty estimation in multi-modal ensembles with downstream decision-making implications.
Abstract
Distribution-free uncertainty estimation for ensemble methods is increasingly desirable due to the widening deployment of multi-modal black-box predictive models. Conformal prediction is one approach that avoids such distributional assumptions. Methods for conformal aggregation have in turn been proposed for ensembled prediction, where the prediction regions of individual models are merged as to retain coverage guarantees while minimizing conservatism. Merging the prediction regions directly, however, sacrifices structures present in the conformal scores that can further reduce conservatism. We, therefore, propose a novel framework that extends the standard scalar formulation of a score function to a multivariate score that produces more efficient prediction regions. We then demonstrate that such a framework can be efficiently leveraged in both classification and predict-then-optimize regression settings downstream and empirically show the advantage over alternate conformal aggregation methods.
