Max-Rank: Efficient Multiple Testing for Conformal Prediction
Alexander Timans, Christoph-Nikolas Straehle, Kaspar Sakmann, Christian A. Naesseth, Eric Nalisnick
TL;DR
This work tackles the challenge of multiple testing in conformal prediction (CP) by introducing max-rank, a rank-based, resampling-inspired correction that aggregates information across tests via an $\ell^{\infty}$-norm in rank space to control the family-wise error rate at level $\alpha$. The approach links to Westfall–Young corrections and copula-based formulations, providing a theoretical guarantee of FWER control and potentially tighter thresholds than Bonferroni under positive dependencies. Empirically, max-rank delivers valid CP coverage with narrower prediction intervals and faster runtimes than copula-based alternatives across multi-target regression and conformal object detection tasks. Overall, max-rank extends the CP toolkit for reliable uncertainty quantification in settings with many parallel tests by leveraging rank-order dependencies without imposing extra CP assumptions.
Abstract
Multiple hypothesis testing (MHT) frequently arises in scientific inquiries, and concurrent testing of multiple hypotheses inflates the risk of Type-I errors or false positives, rendering MHT corrections essential. This paper addresses MHT in the context of conformal prediction, a flexible framework for predictive uncertainty quantification. Some conformal applications give rise to simultaneous testing, and positive dependencies among tests typically exist. We introduce $\texttt{max-rank}$, a novel correction that exploits these dependencies whilst efficiently controlling the family-wise error rate. Inspired by existing permutation-based corrections, $\texttt{max-rank}$ leverages rank order information to improve performance and integrates readily with any conformal procedure. We establish its theoretical and empirical advantages over the common Bonferroni correction and its compatibility with conformal prediction, highlighting the potential to strengthen predictive uncertainty estimates.
