Robust Conformal Prediction Using Privileged Information
Shai Feldman, Yaniv Romano
TL;DR
This work addresses uncertainty quantification under corrupted training data by introducing Privileged Conformal Prediction (PCP), which leverages privileged information available only during training to correct distribution shift caused by corruption. PCP builds on weighted conformal prediction but avoids requiring test-time PI by computing a PI-informed, conservative threshold that guarantees marginal coverage: $\mathbb{P}(Y^{test} \in C^{PCP}(X^{test})) \ge 1-\alpha$. The method includes a scarce-data variant (LOO-PCP) and demonstrates strong empirical performance across causal inference (IHDP), missing response, and noisy label scenarios, achieving valid coverage with informative prediction sets. Overall, PCP provides a theoretically grounded, practical calibration scheme for robust uncertainty quantification in the presence of training-time corruptions, with broad applicability and public software support.
Abstract
We develop a method to generate prediction sets with a guaranteed coverage rate that is robust to corruptions in the training data, such as missing or noisy variables. Our approach builds on conformal prediction, a powerful framework to construct prediction sets that are valid under the i.i.d assumption. Importantly, naively applying conformal prediction does not provide reliable predictions in this setting, due to the distribution shift induced by the corruptions. To account for the distribution shift, we assume access to privileged information (PI). The PI is formulated as additional features that explain the distribution shift, however, they are only available during training and absent at test time. We approach this problem by introducing a novel generalization of weighted conformal prediction and support our method with theoretical coverage guarantees. Empirical experiments on both real and synthetic datasets indicate that our approach achieves a valid coverage rate and constructs more informative predictions compared to existing methods, which are not supported by theoretical guarantees.
