Unveiling Mode Connectivity in Graph Neural Networks
Bingheng Li, Zhikai Chen, Haoyu Han, Shenglai Zeng, Jingzhe Liu, Jiliang Tang
TL;DR
This work investigates mode connectivity in Graph Neural Networks (GNNs), revealing a distinct non-linear connectivity pattern largely governed by graph structure rather than architectural choices. By analyzing 12 diverse graphs and employing linear and quadratic Bézier interpolations between independently trained minima, the authors show that minima are often connected through low-barrier nonlinear paths, with the barrier magnitude correlating with graph properties such as density, homophily, and feature separability. A theoretical bound ties the loss barrier to the spectral gap of the graph, providing principled explanations for when connectivity is strong and how it impacts generalization. The paper further introduces a Wasserstein-distance based metric on mode-connectivity curves to quantify cross-domain graph similarity and bound transferability gaps, offering a practical diagnostic for domain alignment and graph-based knowledge transfer. Overall, the results bridge optimization geometry with graph structure, informing better GNN training and transfer strategies across domains.
Abstract
A fundamental challenge in understanding graph neural networks (GNNs) lies in characterizing their optimization dynamics and loss landscape geometry, critical for improving interpretability and robustness. While mode connectivity, a lens for analyzing geometric properties of loss landscapes has proven insightful for other deep learning architectures, its implications for GNNs remain unexplored. This work presents the first investigation of mode connectivity in GNNs. We uncover that GNNs exhibit distinct non-linear mode connectivity, diverging from patterns observed in fully-connected networks or CNNs. Crucially, we demonstrate that graph structure, rather than model architecture, dominates this behavior, with graph properties like homophily correlating with mode connectivity patterns. We further establish a link between mode connectivity and generalization, proposing a generalization bound based on loss barriers and revealing its utility as a diagnostic tool. Our findings further bridge theoretical insights with practical implications: they rationalize domain alignment strategies in graph learning and provide a foundation for refining GNN training paradigms.
