Conformal Prediction: A Theoretical Note and Benchmarking Transductive Node Classification in Graphs
Pranav Maneriker, Aditya T. Vadlamani, Anutam Srinivasan, Yuntian He, Ali Payani, Srinivasan Parthasarathy
TL;DR
Conformal prediction (CP) provides statistically valid prediction sets with coverage guarantees for graph-based tasks under exchangeability, enabling uncertainty quantification in node classification. The paper analyzes design choices in graph CP, proves a finite-sample efficiency bound comparing randomized vs deterministic adaptive scores, and develops scalable CFGNN-based methods with batching and caching to handle large graphs. It benchmarks a wide spectrum of CP methods (TPS, TPS-Classwise, APS, RAPS, DAPS, DTPS, NAPS, CFGNN) across diverse datasets, revealing clear trade-offs between efficiency and adaptability and showing randomized APS often improves efficiency, especially with many classes. The work offers practical guidelines for method selection, a Python library for graph CP, and avenues for future work such as fairness auditing and non-IID uncertainty considerations.
Abstract
Conformal prediction has become increasingly popular for quantifying the uncertainty associated with machine learning models. Recent work in graph uncertainty quantification has built upon this approach for conformal graph prediction. The nascent nature of these explorations has led to conflicting choices for implementations, baselines, and method evaluation. In this work, we analyze the design choices made in the literature and discuss the tradeoffs associated with existing methods. Building on the existing implementations, we introduce techniques to scale existing methods to large-scale graph datasets without sacrificing performance. Our theoretical and empirical results justify our recommendations for future scholarship in graph conformal prediction.
