Provably Robust Conformal Prediction with Improved Efficiency
Ge Yan, Yaniv Romano, Tsui-Wei Weng
TL;DR
This work addresses the vulnerability of conformal prediction to adversarial perturbations and the associated efficiency drawbacks. It introduces RSCP+, a framework that provides provable robustness guarantees for conformal prediction under bounded adversarial noise, and two practical methods, Post-Training Transformation (PTT) and Robust Conformal Training (RCT), to reduce prediction-set size with minimal overhead. RSCP+ replaces the problematic Monte Carlo-based robustness step with a Hoeffding-based bound on the Monte Carlo score and uses a calibrated threshold to maintain coverage, while PTT and RCT empirically shrink the prediction sets on CIFAR-10/100 and ImageNet without sacrificing robustness. The resulting approach yields substantial empirical gains in efficiency (up to 4.36× on CIFAR-10, 5.46× on CIFAR-100, and 16.9× on ImageNet) and provides a publicly available implementation, making robust conformal prediction more practical for large-scale, real-world scenarios.
Abstract
Conformal prediction is a powerful tool to generate uncertainty sets with guaranteed coverage using any predictive model, under the assumption that the training and test data are i.i.d.. Recently, it has been shown that adversarial examples are able to manipulate conformal methods to construct prediction sets with invalid coverage rates, as the i.i.d. assumption is violated. To address this issue, a recent work, Randomized Smoothed Conformal Prediction (RSCP), was first proposed to certify the robustness of conformal prediction methods to adversarial noise. However, RSCP has two major limitations: (i) its robustness guarantee is flawed when used in practice and (ii) it tends to produce large uncertainty sets. To address these limitations, we first propose a novel framework called RSCP+ to provide provable robustness guarantee in evaluation, which fixes the issues in the original RSCP method. Next, we propose two novel methods, Post-Training Transformation (PTT) and Robust Conformal Training (RCT), to effectively reduce prediction set size with little computation overhead. Experimental results in CIFAR10, CIFAR100, and ImageNet suggest the baseline method only yields trivial predictions including full label set, while our methods could boost the efficiency by up to $4.36\times$, $5.46\times$, and $16.9\times$ respectively and provide practical robustness guarantee. Our codes are available at https://github.com/Trustworthy-ML-Lab/Provably-Robust-Conformal-Prediction.
