Conformal Prediction with Learned Features
Shayan Kiyani, George Pappas, Hamed Hassani
TL;DR
This work addresses the challenge of achieving conditional coverage in conformal prediction by learning uncertainty-driven partitions from calibration data (PLCP). The method optimizes a joint objective over region-wise quantiles and a partitioning function, allowing alternating gradient-descent training on standard models, and introduces MSCE as a measure of conditional miscoverage. Theoretical results provide finite- and infinite-data guarantees that quantify the tradeoffs between the number of regions, sample size, and model class complexity, and translate these into practical fallback coverage guarantees. Empirically, PLCP improves conditional coverage and maintains shorter intervals across diverse ID and OOD datasets, outperforming marginal methods and matching or surpassing baselines that rely on predefined structures. The approach offers a scalable, data-driven path to more trustworthy uncertainty quantification with potential impact in critical applications such as healthcare.
Abstract
In this paper, we focus on the problem of conformal prediction with conditional guarantees. Prior work has shown that it is impossible to construct nontrivial prediction sets with full conditional coverage guarantees. A wealth of research has considered relaxations of full conditional guarantees, relying on some predefined uncertainty structures. Departing from this line of thinking, we propose Partition Learning Conformal Prediction (PLCP), a framework to improve conditional validity of prediction sets through learning uncertainty-guided features from the calibration data. We implement PLCP efficiently with alternating gradient descent, utilizing off-the-shelf machine learning models. We further analyze PLCP theoretically and provide conditional guarantees for infinite and finite sample sizes. Finally, our experimental results over four real-world and synthetic datasets show the superior performance of PLCP compared to state-of-the-art methods in terms of coverage and length in both classification and regression scenarios.
