Towards Graph Foundation Models: Learning Generalities Across Graphs via Task-Trees
Zehong Wang, Zheyuan Zhang, Tianyi Ma, Nitesh V Chawla, Chuxu Zhang, Yanfang Ye
TL;DR
This work tackles cross-task generalization in graphs by introducing task-trees, a unified learning unit that aligns node-, edge-, and graph-level tasks. The authors provide a theoretical foundation—stability, transferability, and generalization bounds—for pretraining on task-trees and demonstrate a graph foundation model, GIT, pretrained on diverse task-trees to generalize across 32 graphs in five domains. They further show that domain specialization via instruction tuning can match or surpass domain-specific experts, while maintaining broad generalization capabilities. Empirically, GIT-G and GIT-S excel under fine-tuning, in-context learning, and zero-shot settings, and task-trees prove more efficient and effective than subgraphs for cross-domain learning. The work suggests a scalable, principled route to general-purpose graph reasoning with practical benefits for real-world heterogeneous graph data.
Abstract
Foundation models are pretrained on large-scale corpora to learn generalizable patterns across domains and tasks -- such as contours, textures, and edges in images, or tokens and sentences in text. In contrast, discovering such generalities in graph-structured data, especially across heterogeneous graph tasks, remains an open challenge. To address this, we propose a novel approach to cross-task generalization in graphs via task-trees, which serve as unified learning instances aligning node-, edge-, and graph-level tasks. We theoretically analyze the stability, transferability, and generalization properties of task-trees, showing that pretraining a graph neural network (GNN) on diverse task-trees with a reconstruction objective induces transferable knowledge. This enables efficient adaptation to downstream tasks with minimal fine-tuning. To validate our framework, we introduce Graph Generality Identifier on Task-Trees (GIT), a graph foundation model that demonstrates strong performance on over 30 graphs across five domains via fine-tuning, in-context learning, and zero-shot generalization. Code and data are available at https://github.com/Zehong-Wang/GIT.
