Noise-Adaptive Conformal Classification with Marginal Coverage

Teresa Bortolotti; Y. X. Rachel Wang; Xin Tong; Alessandra Menafoglio; Simone Vantini; Matteo Sesia

Noise-Adaptive Conformal Classification with Marginal Coverage

Teresa Bortolotti, Y. X. Rachel Wang, Xin Tong, Alessandra Menafoglio, Simone Vantini, Matteo Sesia

TL;DR

The paper tackles conformal classification under label noise by introducing an adaptive calibration that preserves marginal coverage despite deviations from exchangeability. It derives a marginal-coverage inflation factor and develops both finite-sample ($\delta^{\mathrm{FS}}(n)$) and asymptotic ($\delta^{\mathrm{asy}}(n)$) corrections, enabling efficient, informative prediction sets. The approach leverages an estimated contamination model $T$ (and its inverse $W$) to adjust thresholds and guarantee coverage; it offers rigorous finite-sample bounds and practical asymptotic approximations based on a Generalized Brownian Bridge. Empirical results on synthetic data and real datasets CIFAR-10H and BigEarthNet demonstrate substantially improved informativeness while maintaining the desired marginal coverage, with the asymptotic method particularly robust when contamination deviates from simple models. The work provides a versatile framework for reliable uncertainty quantification in noisy-label scenarios and suggests directions for extending to regression and uncertainty in $T$ estimation.

Abstract

Conformal inference provides a rigorous statistical framework for uncertainty quantification in machine learning, enabling well-calibrated prediction sets with precise coverage guarantees for any classification model. However, its reliance on the idealized assumption of perfect data exchangeability limits its effectiveness in the presence of real-world complications, such as low-quality labels -- a widespread issue in modern large-scale data sets. This work tackles this open problem by introducing an adaptive conformal inference method capable of efficiently handling deviations from exchangeability caused by random label noise, leading to informative prediction sets with tight marginal coverage guarantees even in those challenging scenarios. We validate our method through extensive numerical experiments demonstrating its effectiveness on synthetic and real data sets, including CIFAR-10H and BigEarthNet.

Noise-Adaptive Conformal Classification with Marginal Coverage

TL;DR

Abstract

Noise-Adaptive Conformal Classification with Marginal Coverage

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (23)

Theorems & Definitions (26)