Weighted Aggregation of Conformity Scores for Classification
Rui Luo, Zhixin Zhou
TL;DR
This paper extends conformal prediction for multiclass classification by aggregating multiple non-conformity score functions through optimal weight learning. By formulating weighted scores and exploring four data-splitting regimes (VFCP, EFCP, DLCP, DLCP+), it provides finite-sample validity guarantees and near-oracle efficiency, grounded in VC theory with a confirmed VC-dimension upper bound of $d+1$ for the relevant subgraph classes. Theoretical results show that, under reasonable assumptions, coverage remains at $1-oldsymbol{ u}$ while the expected prediction-set size approaches the oracle benchmark as data grow, with explicit bounds for each split strategy. Empirically, the approach yields consistently smaller, valid prediction sets compared to single-score baselines across CIFAR-10/100, and it demonstrates substantial gains when combining models, supporting the practical utility of score-function and model weighting in conformal prediction.
Abstract
Conformal prediction is a powerful framework for constructing prediction sets with valid coverage guarantees in multi-class classification. However, existing methods often rely on a single score function, which can limit their efficiency and informativeness. We propose a novel approach that combines multiple score functions to improve the performance of conformal predictors by identifying optimal weights that minimize prediction set size. Our theoretical analysis establishes a connection between the weighted score functions and subgraph classes of functions studied in Vapnik-Chervonenkis theory, providing a rigorous mathematical basis for understanding the effectiveness of the proposed method. Experiments demonstrate that our approach consistently outperforms single-score conformal predictors while maintaining valid coverage, offering a principled and data-driven way to enhance the efficiency and practicality of conformal prediction in classification tasks.
