Aggregating Conformal Prediction Sets via α-Allocation
Congbin Xu, Yue Yu, Haojie Ren, Zhaojun Wang, Changliang Zou
TL;DR
COLA advances conformal prediction by aggregating multiple nonconformity scores through data-driven confidence-level allocation, forming an intersection of calibrated prediction sets to reduce average set size under marginal coverage. It develops four variants (COLA-e, COLA-s, COLA-f, COLA-l) balancing efficiency, finite-sample validity, full conformalization, and individualized allocation, with theoretical guarantees including asymptotic optimality and, for COLA-s/f, finite-sample validity. Empirical results on synthetic and real data demonstrate consistently smaller sets than state-of-the-art baselines while maintaining coverage, and COLA-l delivers further gains by tailoring allocations to test-point covariates. The work offers practical, scalable strategies for multi-score conformal aggregation and opens directions for improving COLA-f efficiency and exploring alternative allocation criteria in decision-making contexts.
Abstract
Conformal prediction offers a distribution-free framework for constructing prediction sets with finite-sample coverage. Yet, efficiently leveraging multiple conformity scores to reduce prediction set size remains a major open challenge. Instead of selecting a single best score, this work introduces a principled aggregation strategy, COnfidence-Level Allocation (COLA), that optimally allocates confidence levels across multiple conformal prediction sets to minimize empirical set size while maintaining provable coverage. Two variants are further developed, COLA-s and COLA-f, which guarantee finite-sample marginal coverage via sample splitting and full conformalization, respectively. In addition, we develop COLA-l, an individualized allocation strategy that promotes local size efficiency while achieving asymptotic conditional coverage. Extensive experiments on synthetic and real-world datasets demonstrate that COLA achieves considerably smaller prediction sets than state-of-the-art baselines while maintaining valid coverage.
