One Model for One Graph: A New Perspective for Pretraining with Cross-domain Graphs
Jingzhe Liu, Haitao Mao, Zhikai Chen, Bingheng Li, Wenqi Fan, Mingxuan Ju, Tong Zhao, Neil Shah, Jiliang Tang
TL;DR
This work tackles cross-domain graph generalization by proposing OMOG, a two-stage framework that trains a separate source model for each pretraining graph and uses a scoring module to select and fuse a subset of experts at inference. By encoding node attributes into a unified text space and applying non-parametric SGC, OMOG mitigates domain heterogeneity and negative transfer, while its fusion mechanism has a Bayesian averaging interpretation and theoretical guarantees. Empirically, OMOG achieves superior zero-shot and few-shot transfer on ten text-attributed graph datasets for node classification and link prediction, outperforming single-backbone and mixture baselines and offering improved efficiency. The approach offers a scalable path toward graph foundation models, enabling targeted cross-graph transfer and easy incorporation of new graphs without retraining the entire bank.
Abstract
Graph Neural Networks (GNNs) have emerged as a powerful tool to capture intricate network patterns, achieving success across different domains. However, existing GNNs require careful domain-specific architecture designs and training from scratch on each dataset, leading to an expertise-intensive process with difficulty in generalizing across graphs from different domains. Therefore, it can be hard for practitioners to infer which GNN model can generalize well to graphs from their domains. To address this challenge, we propose a novel cross-domain pretraining framework, "one model for one graph," which overcomes the limitations of previous approaches that failed to use a single GNN to capture diverse graph patterns across domains with significant gaps. Specifically, we pretrain a bank of expert models, with each one corresponding to a specific dataset. When inferring to a new graph, gating functions choose a subset of experts to effectively integrate prior model knowledge while avoiding negative transfer. Extensive experiments consistently demonstrate the superiority of our proposed method on both link prediction and node classification tasks.
