Table of Contents
Fetching ...

Both Topology and Text Matter: Revisiting LLM-guided Out-of-Distribution Detection on Text-attributed Graphs

Yinlin Zhu, Di Wu, Xu Wang, Guocong Quan, Miao Hu

TL;DR

This work tackles OOD detection in text-attributed graphs by bridging topology and textual semantics. It introduces LG-Plug, a plug-and-play framework that first aligns topology- and text-derived representations and then derives consensus-driven OOD exposure through clustered, iterative LLM prompting with a lightweight in-cluster codebook. The obtained OOD exposure acts as a regularizer, enabling seamless integration with existing topology-driven detectors and yielding consistent improvements across six TAG benchmarks, including notable FPR95 reductions and AUROC gains. The approach demonstrates that jointly modeling textual and structural information, with efficient LLM guidance, offers robust, scalable OOD detection for open-world graph applications.

Abstract

Text-attributed graphs (TAGs) associate nodes with textual attributes and graph structure, enabling GNNs to jointly model semantic and structural information. While effective on in-distribution (ID) data, GNNs often encounter out-of-distribution (OOD) nodes with unseen textual or structural patterns in real-world settings, leading to overconfident and erroneous predictions in the absence of reliable OOD detection. Early approaches address this issue from a topology-driven perspective, leveraging neighboring structures to mitigate node-level detection bias. However, these methods typically encode node texts as shallow vector features, failing to fully exploit rich semantic information. In contrast, recent LLM-based approaches generate pseudo OOD priors by leveraging textual knowledge, but they suffer from several limitations: (1) a reliability-informativeness imbalance in the synthesized OOD priors, as the generated OOD exposures either deviate from the true OOD semantics, or introduce non-negligible ID noise, all of which offers limited improvement to detection performance; (2) reliance on specialized architectures, which prevents incorporation of the extensive effective topology-level insights that have been empirically validated in prior work. To this end, we propose LG-Plug, an LLM-Guided Plug-and-play strategy for TAG OOD detection tasks. LG-Plug aligns topology and text representations to produce fine-grained node embeddings, then generates consensus-driven OOD exposure via clustered iterative LLM prompting. Moreover, it leverages lightweight in-cluster codebook and heuristic sampling reduce time cost of LLM querying. The resulting OOD exposure serves as a regularization term to separate ID and OOD nodes, enabling seamless integration with existing detectors.

Both Topology and Text Matter: Revisiting LLM-guided Out-of-Distribution Detection on Text-attributed Graphs

TL;DR

This work tackles OOD detection in text-attributed graphs by bridging topology and textual semantics. It introduces LG-Plug, a plug-and-play framework that first aligns topology- and text-derived representations and then derives consensus-driven OOD exposure through clustered, iterative LLM prompting with a lightweight in-cluster codebook. The obtained OOD exposure acts as a regularizer, enabling seamless integration with existing topology-driven detectors and yielding consistent improvements across six TAG benchmarks, including notable FPR95 reductions and AUROC gains. The approach demonstrates that jointly modeling textual and structural information, with efficient LLM guidance, offers robust, scalable OOD detection for open-world graph applications.

Abstract

Text-attributed graphs (TAGs) associate nodes with textual attributes and graph structure, enabling GNNs to jointly model semantic and structural information. While effective on in-distribution (ID) data, GNNs often encounter out-of-distribution (OOD) nodes with unseen textual or structural patterns in real-world settings, leading to overconfident and erroneous predictions in the absence of reliable OOD detection. Early approaches address this issue from a topology-driven perspective, leveraging neighboring structures to mitigate node-level detection bias. However, these methods typically encode node texts as shallow vector features, failing to fully exploit rich semantic information. In contrast, recent LLM-based approaches generate pseudo OOD priors by leveraging textual knowledge, but they suffer from several limitations: (1) a reliability-informativeness imbalance in the synthesized OOD priors, as the generated OOD exposures either deviate from the true OOD semantics, or introduce non-negligible ID noise, all of which offers limited improvement to detection performance; (2) reliance on specialized architectures, which prevents incorporation of the extensive effective topology-level insights that have been empirically validated in prior work. To this end, we propose LG-Plug, an LLM-Guided Plug-and-play strategy for TAG OOD detection tasks. LG-Plug aligns topology and text representations to produce fine-grained node embeddings, then generates consensus-driven OOD exposure via clustered iterative LLM prompting. Moreover, it leverages lightweight in-cluster codebook and heuristic sampling reduce time cost of LLM querying. The resulting OOD exposure serves as a regularization term to separate ID and OOD nodes, enabling seamless integration with existing detectors.
Paper Structure (21 sections, 16 equations, 5 figures, 4 tables, 1 algorithm)

This paper contains 21 sections, 16 equations, 5 figures, 4 tables, 1 algorithm.

Figures (5)

  • Figure 1: Illustration of the reliability-informativeness imbalance in LLM-based OOD prior generation: (a) Realistic ID and OOD categories on Cora Yang16cora. (b) Reliability-oriented methods avoid ID overlap but yield semantically unrealistic priors, limiting informativeness. (c) Informativeness-oriented methods produce realistic OOD priors by labeling unlabeled nodes but risk mislabeling IDs to harm detection.
  • Figure 2: Overview of the proposed LG-Plug plug-and-play framework. (a) Topology-Text Representation Alignment jointly trains a graph encoder and a text encoder to learn fine-grained and discriminative node embeddings for both ID and OOD nodes. (b) Consensus-driven OOD Exposure with LLM identifies reliable and informative OOD exposures from unlabeled nodes by prompting a LLM while remaining scalable. The extracted OOD exposures act as a regularization signal and can be seamlessly integrated into existing topology-driven graph OOD detectors.
  • Figure 3: Distribution of negative energy scores on the Cora dataset. The dashed line indicates the threshold for a 95% true positive rate (TPR).
  • Figure 4: Sensitivity analysis for the number of clusters (corresponding to Eq. (\ref{['eq: clustering']})) on the Cora dataset.
  • Figure 5: Word cloud visualizations on the Cora dataset. (a)-(g) show ground-truth ID and OOD categories, while (h) displays all the OOD nodes. (i)-(l) denotes OOD exposure from our LG-Plug and baselines. Stopwords from both WordCloud’s built-in list and our custom list (Appendix \ref{['appendix: stopwords']}) were removed for clarity.