Subgraph Pooling: Tackling Negative Transfer on Graphs
Zehong Wang, Zheyuan Zhang, Chuxu Zhang, Yanfang Ye
TL;DR
The paper studies negative transfer in graph neural networks caused by structural differences between semantically similar source and target graphs, which induce distribution shifts in node embeddings. It introduces Subgraph Pooling (SP) and Subgraph Pooling++ (SP++) to transfer subgraph‑level information by sampling $k$‑hop neighborhoods or random walks, thereby reducing the graph discrepancy measured by CMD and improving transfer performance. The authors provide theoretical analysis showing how SP reduces discrepancy and demonstrate, across diverse datasets and transfer settings, that SP/SP++ consistently outperform strong baselines with no extra learnable parameters and compatibility with any GNN backbone. The work offers a practical, efficient approach to robust graph transfer learning with broad applicability to evolving graph structures in real‑world tasks, and it is accompanied by public code and data.
Abstract
Transfer learning aims to enhance performance on a target task by using knowledge from related tasks. However, when the source and target tasks are not closely aligned, it can lead to reduced performance, known as negative transfer. Unlike in image or text data, we find that negative transfer could commonly occur in graph-structured data, even when source and target graphs have semantic similarities. Specifically, we identify that structural differences significantly amplify the dissimilarities in the node embeddings across graphs. To mitigate this, we bring a new insight in this paper: for semantically similar graphs, although structural differences lead to significant distribution shift in node embeddings, their impact on subgraph embeddings could be marginal. Building on this insight, we introduce Subgraph Pooling (SP) by aggregating nodes sampled from a k-hop neighborhood and Subgraph Pooling++ (SP++) by a random walk, to mitigate the impact of graph structural differences on knowledge transfer. We theoretically analyze the role of SP in reducing graph discrepancy and conduct extensive experiments to evaluate its superiority under various settings. The proposed SP methods are effective yet elegant, which can be easily applied on top of any backbone Graph Neural Networks (GNNs). Our code and data are available at: https://github.com/Zehong-Wang/Subgraph-Pooling.
