LLMs as Zero-shot Graph Learners: Alignment of GNN Representations with LLM Token Embeddings
Duo Wang, Yuan Zuo, Fengzhi Li, Junjie Wu
TL;DR
TEA-GLM introduces a cross-dataset, cross-task zero-shot framework by aligning GNN representations with LLM token embeddings and mapping graph features into a fixed set of graph tokens via a linear projector. It jointly trains two contrastive objectives—instance-wise across graph views and feature-wise to LLM embeddings—enhanced by PCA-based alignment and a designed instruction paradigm that unifies node-, edge-, and graph-level tasks. Empirical results show TEA-GLM achieving state-of-the-art zero-shot performance on unseen datasets and tasks, with ablations confirming the critical roles of graph token embeddings and feature-wise alignment. This approach enables transferable graph reasoning with a fixed LLM, reducing task-specific fine-tuning while leveraging LLM knowledge for practical zero-shot graph learning.
Abstract
Zero-shot graph machine learning, especially with graph neural networks (GNNs), has garnered significant interest due to the challenge of scarce labeled data. While methods like self-supervised learning and graph prompt learning have been extensively explored, they often rely on fine-tuning with task-specific labels, limiting their effectiveness in zero-shot scenarios. Inspired by the zero-shot capabilities of instruction-fine-tuned large language models (LLMs), we introduce a novel framework named Token Embedding-Aligned Graph Language Model (TEA-GLM) that leverages LLMs as cross-dataset and cross-task zero-shot learners for graph machine learning. Concretely, we pretrain a GNN, aligning its representations with token embeddings of an LLM. We then train a linear projector that transforms the GNN's representations into a fixed number of graph token embeddings without tuning the LLM. A unified instruction is designed for various graph tasks at different levels, such as node classification (node-level) and link prediction (edge-level). These design choices collectively enhance our method's effectiveness in zero-shot learning, setting it apart from existing methods. Experiments show that our graph token embeddings help the LLM predictor achieve state-of-the-art performance on unseen datasets and tasks compared to other methods using LLMs as predictors.
