Towards A Universal Graph Structural Encoder
Jialin Chen, Haolan Zuo, Haoyu Peter Wang, Siqi Miao, Pan Li, Rex Ying
TL;DR
GFSE introduces a universal graph structural encoder pre-trained across multiple domains with four self-supervised objectives, leveraging a Graph Transformer with biased attention to produce expressive positional and structural encodings (PSE). By integrating relative structural information via random-walk-based encodings and SEG-WL inspired expressiveness, GFSE achieves strong cross-domain transfer, improving downstream GNNs and enabling seamless augmentation of text-attributed graphs and LLMs. Empirical results across synthetic and real-world datasets show GFSE delivers robust gains, including state-of-the-art performance in many settings and notable improvements in molecular and large-scale graph tasks. The work demonstrates the practicality of a domain-agnostic graph foundation model that reduces task-specific fine-tuning and supports integration with downstream feature encoders and language models for broad applicability.
Abstract
Recent advancements in large-scale pre-training have shown the potential to learn generalizable representations for downstream tasks. In the graph domain, however, capturing and transferring structural information across different graph domains remains challenging, primarily due to the inherent differences in topological patterns across various contexts. Additionally, most existing models struggle to capture the complexity of rich graph structures, leading to inadequate exploration of the embedding space. To address these challenges, we propose GFSE, a universal graph structural encoder designed to capture transferable structural patterns across diverse domains such as molecular graphs, social networks, and citation networks. GFSE is the first cross-domain graph structural encoder pre-trained with multiple self-supervised learning objectives. Built on a Graph Transformer, GFSE incorporates attention mechanisms informed by graph inductive bias, enabling it to encode intricate multi-level and fine-grained topological features. The pre-trained GFSE produces generic and theoretically expressive positional and structural encoding for graphs, which can be seamlessly integrated with various downstream graph feature encoders, including graph neural networks for vectorized features and Large Language Models for text-attributed graphs. Comprehensive experiments on synthetic and real-world datasets demonstrate GFSE's capability to significantly enhance the model's performance while requiring substantially less task-specific fine-tuning. Notably, GFSE achieves state-of-the-art performance in 81.6% evaluated cases, spanning diverse graph models and datasets, highlighting its potential as a powerful and versatile encoder for graph-structured data.
