Automatically Adaptive Conformal Risk Control
Vincent Blot, Anastasios N Angelopoulos, Michael I Jordan, Nicolas J-B Brunel
TL;DR
This work introduces automatically adaptive conformal risk control (AA-CRC), a framework that extends conformal risk control to adapt uncertainty to input difficulty by learning an automatic conditioning function class. It builds on conformal prediction and the multiaccuracy perspective to provide conditional risk guarantees that are robust to covariate shifts represented in an embedding space or group structure, while handling label-conditional coverage. Theoretical results connect a tilted risk bound $\mathbb{E}_{\lambda}[\ell(Y_{n+1},\mathcal{C}(X_{n+1}))]$ to the choice of $\lambda$ and a regularizer, enabling efficient computation via surrogate objectives $\tilde{J}(\lambda)$. Empirically, AA-CRC improves precision at fixed recall in semantic segmentation (polyp and fire datasets) by learning adaptive thresholds from embeddings or random-forest leaves, illustrating practical gains in uncertainty quantification for black-box models with controlled risk. The approach offers a scalable, data-driven path to automatic conditional guarantees in high-dimensional and structured prediction tasks.
Abstract
Science and technology have a growing need for effective mechanisms that ensure reliable, controlled performance from black-box machine learning algorithms. These performance guarantees should ideally hold conditionally on the input-that is the performance guarantees should hold, at least approximately, no matter what the input. However, beyond stylized discrete groupings such as ethnicity and gender, the right notion of conditioning can be difficult to define. For example, in problems such as image segmentation, we want the uncertainty to reflect the intrinsic difficulty of the test sample, but this may be difficult to capture via a conditioning event. Building on the recent work of Gibbs et al. [2023], we propose a methodology for achieving approximate conditional control of statistical risks-the expected value of loss functions-by adapting to the difficulty of test samples. Our framework goes beyond traditional conditional risk control based on user-provided conditioning events to the algorithmic, data-driven determination of appropriate function classes for conditioning. We apply this framework to various regression and segmentation tasks, enabling finer-grained control over model performance and demonstrating that by continuously monitoring and adjusting these parameters, we can achieve superior precision compared to conventional risk-control methods.
