Length Optimization in Conformal Prediction

Shayan Kiyani; George Pappas; Hamed Hassani

Length Optimization in Conformal Prediction

Shayan Kiyani, George Pappas, Hamed Hassani

TL;DR

This work addresses the tension between conditional validity and length efficiency in conformal prediction by introducing Conformal Prediction with Length-Optimization (CPL). The authors formulate a minimax duality framework that characterizes optimal length via level-set interpretations of conditional densities and propose a practical finite-sample algorithm that optimizes adaptive thresholds over a covariate shift class $\mathcal{F}$ and a structured prediction class $\mathcal{H}$. They establish strong duality in the infinite-sample setting and finite-sample guarantees under realizability or bounded complexity, then demonstrate substantial length reductions across marginal, group-conditional, and covariate-shift scenarios on regression, text, and vision tasks, with open-source code available. The results indicate CPL can provide tighter, conditionally valid prediction sets across diverse data regimes, offering a scalable and principled route to more informative uncertainty quantification.

Abstract

Conditional validity and length efficiency are two crucial aspects of conformal prediction (CP). Conditional validity ensures accurate uncertainty quantification for data subpopulations, while proper length efficiency ensures that the prediction sets remain informative. Despite significant efforts to address each of these issues individually, a principled framework that reconciles these two objectives has been missing in the CP literature. In this paper, we develop Conformal Prediction with Length-Optimization (CPL) - a novel and practical framework that constructs prediction sets with (near-) optimal length while ensuring conditional validity under various classes of covariate shifts, including the key cases of marginal and group-conditional coverage. In the infinite sample regime, we provide strong duality results which indicate that CPL achieves conditional validity and length optimality. In the finite sample regime, we show that CPL constructs conditionally valid prediction sets. Our extensive empirical evaluations demonstrate the superior prediction set size performance of CPL compared to state-of-the-art methods across diverse real-world and synthetic datasets in classification, regression, and large language model-based multiple choice question answering. An Implementation of our algorithm can be accessed at the following link: https://github.com/shayankiyani98/CP.

Length Optimization in Conformal Prediction

TL;DR

and a structured prediction class

. They establish strong duality in the infinite-sample setting and finite-sample guarantees under realizability or bounded complexity, then demonstrate substantial length reductions across marginal, group-conditional, and covariate-shift scenarios on regression, text, and vision tasks, with open-source code available. The results indicate CPL can provide tighter, conditionally valid prediction sets across diverse data regimes, offering a scalable and principled route to more informative uncertainty quantification.

Abstract

Paper Structure (24 sections, 17 theorems, 174 equations, 12 figures, 3 tables, 1 algorithm)

This paper contains 24 sections, 17 theorems, 174 equations, 12 figures, 3 tables, 1 algorithm.

Introduction
Problem Formulation
Preliminaries on conditional validity and length of prediction sets
Problem Statement
Minimax Formulations
The Equivalent Minimax Formulation: A Duality Perspective
Relaxed Minimax Formulation using Structured Prediction Sets
Theoretical Guarantees for the Relaxed Minimax Problem
Finite Sample Setting: The Main Algorithm
Finite Sample Guarantees
Experiments
Part I: Marginal Coverage
Real-world Regression Experiment
Multiple Choice Text Data
Part II: Group-Conditional Coverage
...and 9 more sections

Key Result

Proposition 3.1

Assuming $\mathcal{D}_{Y|X}$ is continuous, the Primary Problem and the Minimax Problem are equivalent. Let $(f^*, C^*(x))$ be an optimal solution of the Minimax Problem. Then, $C^*$ is also the optimal solution of the Primary Problem. Furthermore, $C^*$ has the following form:

Figures (12)

Figure 1: The CP pipeline.
Figure 4: Left-hand-side plot shows coverage and right-hand-side shows mean prediction set size.
Figure 5: Left-hand-side plot shows coverage and right-hand-side shows mean interval length.
Figure 6: Left-hand-side plot shows coverage and right-hand-side shows mean prediction set size. The reported values are averaged over 20 different splits of calibration data.
Figure : (a)
...and 7 more figures

Theorems & Definitions (29)

Proposition 3.1
Example 3.2: Marginal case
Example 3.3: Group-conditional case
Proposition 3.4: Variational representation
Lemma 3.5
Theorem 3.6
Definition 3.7: Realizability
Proposition 3.8
Remark 3.9
Remark 4.1
...and 19 more

Length Optimization in Conformal Prediction

TL;DR

Abstract

Length Optimization in Conformal Prediction

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (12)

Theorems & Definitions (29)