Bounded and Uniform Energy-based Out-of-distribution Detection for Graphs
Shenzhi Yang, Bin Liang, An Liu, Lin Gui, Xingkai Yao, Xiaofang Zhang
TL;DR
This paper tackles node-level OOD detection in graphs, where GNNSAFE’s energy-based score aggregation can become unreliable due to extreme, unbounded negative energies and logit shifts. The authors introduce NODESAFE, adding two regularizers, $\mathcal{L}_{bound}$ and $\mathcal{L}_{uniform}$, to bound the logit $\ell_2$ norm and reduce logit-sum variance, respectively, forming $\mathcal{L}_{UB}$ that augments the existing energy-based objective $\mathcal{L}_{OOD}$ into $\mathcal{L}_{ALL} = \mathcal{L}_{OOD} + \lambda_2 \mathcal{L}_{UB}$. The method yields strong improvements in node-level OOD detection across multiple datasets and OOD generation scenarios, with substantial reductions in FPR95 and competitive AUROC/AUPR metrics, while maintaining efficient training and faster convergence. The work advances reliable GNN-based OOD detection for security-sensitive graph applications by addressing both the upper and lower bounds of the logit-derived energies. Overall, NODESAFE provides a principled, scalable approach to stabilize energy-based OOD detection on graphs and broadens its practical impact in real-world graph domains.
Abstract
Given the critical role of graphs in real-world applications and their high-security requirements, improving the ability of graph neural networks (GNNs) to detect out-of-distribution (OOD) data is an urgent research problem. The recent work GNNSAFE proposes a framework based on the aggregation of negative energy scores that significantly improves the performance of GNNs to detect node-level OOD data. However, our study finds that score aggregation among nodes is susceptible to extreme values due to the unboundedness of the negative energy scores and logit shifts, which severely limits the accuracy of GNNs in detecting node-level OOD data. In this paper, we propose NODESAFE: reducing the generation of extreme scores of nodes by adding two optimization terms that make the negative energy scores bounded and mitigate the logit shift. Experimental results show that our approach dramatically improves the ability of GNNs to detect OOD data at the node level, e.g., in detecting OOD data induced by Structure Manipulation, the metric of FPR95 (lower is better) in scenarios without (with) OOD data exposure are reduced from the current SOTA by 28.4% (22.7%).
