Distribution Free Prediction Sets for Node Classification
Jase Clarkson
TL;DR
This work addresses the lack of calibrated uncertainty for node classification in graphs by extending conformal prediction to non-exchangeable data using graph-aware weighting. It introduces Neighbourhood Adaptive Prediction Sets (NAPS), which calibrate prediction sets within a node's local neighborhood and two weighted variants (NAPS-H, NAPS-G) to balance sample size and similarity. The approach comes with theoretical guarantees on coverage and is shown to produce tighter, better-calibrated sets than naive APS across multiple GNN architectures and standard datasets, with supporting analysis in a Stochastic Block Model setting. The method enables reliable inductive uncertainty quantification for graph-structured data and can be extended to heterophilic graphs, weighted/directed networks, and node regression tasks.
Abstract
Graph Neural Networks (GNNs) are able to achieve high classification accuracy on many important real world datasets, but provide no rigorous notion of predictive uncertainty. Quantifying the confidence of GNN models is difficult due to the dependence between datapoints induced by the graph structure. We leverage recent advances in conformal prediction to construct prediction sets for node classification in inductive learning scenarios. We do this by taking an existing approach for conformal classification that relies on \textit{exchangeable} data and modifying it by appropriately weighting the conformal scores to reflect the network structure. We show through experiments on standard benchmark datasets using popular GNN models that our approach provides tighter and better calibrated prediction sets than a naive application of conformal prediction.
