Large Language Models Meet Graph Neural Networks: A Perspective of Graph Mining
Yuxin You, Zhen Liu, Xiangchao Wen, Yongtao Zhang, Wei Ai
TL;DR
This paper addresses generalization gaps in graph mining by examining how large language models can enrich graph-structured data. It introduces a three-way taxonomy—GNN-driving-LLM, LLM-driving-GNN, and GNN-LLM-co-driving—and surveys techniques across prompting, encoding, fine-tuning, and joint architectures. The review synthesizes representative works (e.g., GraphFormers, GPT4Graph, GLEM, GRENADE) and analyzes trade-offs between semantic richness and computational cost, noting modest yet consistent gains over baselines. It highlights promising directions in multimodal graph processing, hallucination mitigation, and tackling complex graph tasks to broaden the impact of graph mining.
Abstract
Graph mining is an important area in data mining and machine learning that involves extracting valuable information from graph-structured data. In recent years, significant progress has been made in this field through the development of graph neural networks (GNNs). However, GNNs are still deficient in generalizing to diverse graph data. Aiming to this issue, Large Language Models (LLMs) could provide new solutions for graph mining tasks with their superior semantic understanding. In this review, we systematically review the combination and application techniques of LLMs and GNNs and present a novel taxonomy for research in this interdisciplinary field, which involves three main categories: GNN-driving-LLM, LLM-driving-GNN, and GNN-LLM-co-driving. Within this framework, we reveal the capabilities of LLMs in enhancing graph feature extraction as well as improving the effectiveness of downstream tasks such as node classification, link prediction, and community detection. Although LLMs have demonstrated their great potential in handling graph-structured data, their high computational requirements and complexity remain challenges. Future research needs to continue to explore how to efficiently fuse LLMs and GNNs to achieve more powerful graph learning and reasoning capabilities and provide new impetus for the development of graph mining techniques.
