Enhancing Conformal Prediction Using E-Test Statistics

A. A. Balinsky; A. D. Balinsky

Enhancing Conformal Prediction Using E-Test Statistics

A. A. Balinsky, A. D. Balinsky

TL;DR

This work tackles uncertainty quantification in machine learning predictions via Conformal Prediction (CP) under exchangeability, and proposes enhancing CP with e-test statistics using a BB-predictor. The core theoretical contribution shows that for exchangeable non-negative scores $L_i$, the ratio $F = \frac{L_{n+1}}{(\sum_{j=1}^{n+1} L_j)/(n+1)}$ satisfies $\mathbf{E}(F)=1$ and $P(F \ge 1/\alpha) \le \alpha$, enabling a BB-predictor based bound on $L_{n+1}$ and thus a distribution-free conformal guarantee. The paper complements theory with MNIST-based experiments comparing e-test statistics to p-value statistics in inductive CP, highlighting distinct label-allocations and validating the practical utility of the BB-predictor approach. Overall, it offers a principled, distribution-free mechanism for reliable uncertainty quantification in prediction intervals, with potential impact on vision tasks and beyond, by broadening the CP toolkit beyond p-values to e-test based bounds.

Abstract

Conformal Prediction (CP) serves as a robust framework that quantifies uncertainty in predictions made by Machine Learning (ML) models. Unlike traditional point predictors, CP generates statistically valid prediction regions, also known as prediction intervals, based on the assumption of data exchangeability. Typically, the construction of conformal predictions hinges on p-values. This paper, however, ventures down an alternative path, harnessing the power of e-test statistics to augment the efficacy of conformal predictions by introducing a BB-predictor (bounded from the below predictor).

Enhancing Conformal Prediction Using E-Test Statistics

TL;DR

, the ratio

satisfies

and

, enabling a BB-predictor based bound on

and thus a distribution-free conformal guarantee. The paper complements theory with MNIST-based experiments comparing e-test statistics to p-value statistics in inductive CP, highlighting distinct label-allocations and validating the practical utility of the BB-predictor approach. Overall, it offers a principled, distribution-free mechanism for reliable uncertainty quantification in prediction intervals, with potential impact on vision tasks and beyond, by broadening the CP toolkit beyond p-values to e-test based bounds.

Abstract

Paper Structure (6 sections, 3 theorems, 16 equations, 6 figures)

This paper contains 6 sections, 3 theorems, 16 equations, 6 figures.

Introduction
Main Result
Experiments and Results
Inductive Conformal Prediction with e-test statistics
Inductive Conformal Prediction with p-value statistics
Conclusion

Key Result

Lemma 1

Suppose $L_1, \ldots, L_{n+1}$ are exchangeable random variables. Set i.e., the number of $L$ that are at least as large as the last one. Then for all $\epsilon\in [0,1]$

Figures (6)

Figure 1: The first nine images from the MNIST database of handwritten digits.
Figure 2: Our model.
Figure 3: Some images that the model predicts the wrong labels.
Figure 4: Values of the loss function on the CalibrationSet.
Figure 5: Some images that have multiple labels prediction with their labels.
...and 1 more figures

Theorems & Definitions (5)

Lemma 1
Theorem 1
proof
Corollary 1.1
Definition

Enhancing Conformal Prediction Using E-Test Statistics

TL;DR

Abstract

Enhancing Conformal Prediction Using E-Test Statistics

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (5)