A Survey on Self-Supervised Graph Foundation Models: Knowledge-Based Perspective
Ziwen Zhao, Yixin Su, Yuhua Li, Yixiong Zou, Ruixuan Li, Rui Zhang
TL;DR
This survey introduces a knowledge-based taxonomy for self-supervised graph foundation models (GFMs), organizing graph knowledge into microscopic, mesoscopic, and macroscopic categories and mapping them to over $25$ pretext tasks across GNN/GT/GLM architectures. It unifies pre-training and downstream tuning under a common framework, highlights representative methods, and analyzes efficiency, scalability, and applicability. The work also surveys self-supervised graph language models, detailing how GLMs integrate graph knowledge through pre-training and how prompting and PEFT enable efficient downstream adaptation. Challenging directions include combining knowledge patterns, cross-graph-type adaptation, bias robustness, and explainable reasoning with RoG and RAG approaches, signaling a roadmap for robust, scalable, and interpretable GFMs. Collectively, the taxonomy and syntheses offer a foundation for designing generalized GFMs that leverage graph-specific knowledge across diverse data modalities and tasks.
Abstract
Graph self-supervised learning (SSL) is now a go-to method for pre-training graph foundation models (GFMs). There is a wide variety of knowledge patterns embedded in the graph data, such as node properties and clusters, which are crucial to learning generalized representations for GFMs. However, existing surveys of GFMs have several shortcomings: they lack comprehensiveness regarding the most recent progress, have unclear categorization of self-supervised methods, and take a limited architecture-based perspective that is restricted to only certain types of graph models. As the ultimate goal of GFMs is to learn generalized graph knowledge, we provide a comprehensive survey of self-supervised GFMs from a novel knowledge-based perspective. We propose a knowledge-based taxonomy, which categorizes self-supervised graph models by the specific graph knowledge utilized. Our taxonomy consists of microscopic (nodes, links, etc.), mesoscopic (context, clusters, etc.), and macroscopic knowledge (global structure, manifolds, etc.). It covers a total of 9 knowledge categories and more than 25 pretext tasks for pre-training GFMs, as well as various downstream task generalization strategies. Such a knowledge-based taxonomy allows us to re-examine graph models based on new architectures more clearly, such as graph language models, as well as provide more in-depth insights for constructing GFMs.
