Non-Homophilic Graph Pre-Training and Prompt Learning
Xingtong Yu, Jie Zhang, Yuan Fang, Renhe Jiang
TL;DR
The paper addresses pre-training and prompt learning for non-homophilic graphs, where neighbor-label relationships are mixed and not uniformly aligned. It analyzes the limitations of homophily-based pre-training and introduces ProNoG, which uses a condition-net to generate node-specific prompts derived from each node's non-homophilic neighborhood patterns, enabling fine-grained downstream adaptation while freezing the pre-trained encoder. The approach is supported by theoretical insights on why non-homophily tasks are preferable in non-homophilic settings and extensive experiments on ten public datasets showing ProNoG outperforms strong baselines, including uniform graph prompts. The work advances efficient, node-wise adaptation for few-shot graph tasks and offers practical guidance for choosing pre-training objectives in heterogeneous graph settings.
Abstract
Graphs are ubiquitous for modeling complex relationships between objects across various fields. Graph neural networks (GNNs) have become a mainstream technique for graph-based applications, but their performance heavily relies on abundant labeled data. To reduce labeling requirement, pre-training and prompt learning has become a popular alternative. However, most existing prompt methods do not differentiate homophilic and heterophilic characteristics of real-world graphs. In particular, many real-world graphs are non-homophilic, not strictly or uniformly homophilic with mixing homophilic and heterophilic patterns, exhibiting varying non-homophilic characteristics across graphs and nodes. In this paper, we propose ProNoG, a novel pre-training and prompt learning framework for such non-homophilic graphs. First, we analyze existing graph pre-training methods, providing theoretical insights into the choice of pre-training tasks. Second, recognizing that each node exhibits unique non-homophilic characteristics, we propose a conditional network to characterize the node-specific patterns in downstream tasks. Finally, we thoroughly evaluate and analyze ProNoG through extensive experiments on ten public datasets.
