GraphAlign: Pretraining One Graph Neural Network on Multiple Graphs via Feature Alignment
Zhenyu Hou, Haozhan Li, Yukuo Cen, Jie Tang, Yuxiao Dong
TL;DR
GraphAlign tackles the problem of training one graph neural network across heterogeneous graphs with misaligned node features. It introduces a modular feature-alignment pipeline—a language-model feature encoder, per-graph normalization, and a mixture-of-feature-experts—that pretrains a unified GNN within existing graph SSL frameworks and transfers effectively to unseen graphs. Empirical results on OGB and knowledge-graph tasks show improvements over naive joint training and strong cross-domain transfer, including few-shot scenarios, with an in-context inference strategy that requires no task-specific prompts. This approach broadens the practicality of graph pretraining by enabling cross-graph generalization without relying on task labels or extensive prompt engineering.
Abstract
Graph self-supervised learning (SSL) holds considerable promise for mining and learning with graph-structured data. Yet, a significant challenge in graph SSL lies in the feature discrepancy among graphs across different domains. In this work, we aim to pretrain one graph neural network (GNN) on a varied collection of graphs endowed with rich node features and subsequently apply the pretrained GNN to unseen graphs. We present a general GraphAlign method that can be seamlessly integrated into the existing graph SSL framework. To align feature distributions across disparate graphs, GraphAlign designs alignment strategies of feature encoding, normalization, alongside a mixture-of-feature-expert module. Extensive experiments show that GraphAlign empowers existing graph SSL frameworks to pretrain a unified and powerful GNN across multiple graphs, showcasing performance superiority on both in-domain and out-of-domain graphs.
