Robust Yet Efficient Conformal Prediction Sets
Soroush H. Zargarbashi, Mohammad Sadegh Akhondzadeh, Aleksandar Bojchevski
TL;DR
This work tackles the vulnerability of conformal prediction (CP) to adversarial evasion and calibration-data poisoning by introducing CDF-Aware Sets (CAS). CAS derives provable robustness using randomized smoothing to bound worst-case conformity-score changes under perturbations and leverages the score's CDF to obtain tighter upper bounds, improving efficiency over prior certificates. It extends robustness guarantees to both feature and label poisoning and to discrete, sparse data, while incorporating finite-sample corrections for practical deployment. The calibration-time certificate further reduces computational cost and set sizes, enabling scalable robust CP on large datasets and graphs. Overall, CAS provides provably robust yet efficient prediction sets that preserve CP’s distribution-free guarantees in adversarial environments, with broad applicability across modalities and data types.
Abstract
Conformal prediction (CP) can convert any model's output into prediction sets guaranteed to include the true label with any user-specified probability. However, same as the model itself, CP is vulnerable to adversarial test examples (evasion) and perturbed calibration data (poisoning). We derive provably robust sets by bounding the worst-case change in conformity scores. Our tighter bounds lead to more efficient sets. We cover both continuous and discrete (sparse) data and our guarantees work both for evasion and poisoning attacks (on both features and labels).
