Towards Precise Prediction Uncertainty in GNNs: Refining GNNs with Topology-grouping Strategy
Hyunjin Seo, Kyusung Seo, Joonhyung Park, Eunho Yang
TL;DR
This work addresses the miscalibration challenge in graph neural networks by demonstrating that calibration errors cannot be captured by a single trend based solely on neighborhood similarity. It introduces Simi-Mailbox, a post-hoc calibration method that groups nodes by both neighborhood similarity and individual confidence, applying per-group temperatures to calibrate predictions without relying on proximity or connectivity. Through extensive experiments across small, medium, and large graphs, Simi-Mailbox achieves up to 13.79% error reduction in calibration error (ECE) and consistently outperforms strong baselines across various backbones and settings, including heterophilous graphs and self-training. The approach preserves accuracy while improving reliability, offering a scalable and robust solution for uncertainty quantification in GNNs with practical impact on downstream decision-making.
Abstract
Recent advancements in graph neural networks (GNNs) have highlighted the critical need of calibrating model predictions, with neighborhood prediction similarity recognized as a pivotal component. Existing studies suggest that nodes with analogous neighborhood prediction similarity often exhibit similar calibration characteristics. Building on this insight, recent approaches incorporate neighborhood similarity into node-wise temperature scaling techniques. However, our analysis reveals that this assumption does not hold universally. Calibration errors can differ significantly even among nodes with comparable neighborhood similarity, depending on their confidence levels. This necessitates a re-evaluation of existing GNN calibration methods, as a single, unified approach may lead to sub-optimal calibration. In response, we introduce **Simi-Mailbox**, a novel approach that categorizes nodes by both neighborhood similarity and their own confidence, irrespective of proximity or connectivity. Our method allows fine-grained calibration by employing *group-specific* temperature scaling, with each temperature tailored to address the specific miscalibration level of affiliated nodes, rather than adhering to a uniform trend based on neighborhood similarity. Extensive experiments demonstrate the effectiveness of our **Simi-Mailbox** across diverse datasets on different GNN architectures, achieving up to 13.79\% error reduction compared to uncalibrated GNN predictions.
