Survey on Generalization Theory for Graph Neural Networks
Antonis Vasileiou, Stefanie Jegelka, Ron Levie, Christopher Morris
TL;DR
This survey addresses the gap in theory for how and when MPNNs generalize beyond training graphs. It synthesizes bounds from multiple formalisms—VC-dimension, Rademacher complexity, covering numbers, stability, PAC-Bayes, graphon theory, and transductive/OOD analyses—to provide a unified view of MPNN generalization across graph- and node-level tasks. Key insights include the connection between $1$-WL expressivity and VC bounds, the tighter nature of data-dependent Rademacher bounds for MPNNs, and graphon- and diffusion-matrix-based perspectives that yield distribution-aware guarantees. The work highlights open problems and future directions, such as leveraging graph-specific structure to tighten bounds, extending beyond $1$-WL-expressivity, and developing practical, informative bounds for real-world graph distributions with OOD and size-transfer considerations.
Abstract
Message-passing graph neural networks (MPNNs) have emerged as the leading approach for machine learning on graphs, attracting significant attention in recent years. While a large set of works explored the expressivity of MPNNs, i.e., their ability to separate graphs and approximate functions over them, comparatively less attention has been directed toward investigating their generalization abilities, i.e., making meaningful predictions beyond the training data. Here, we systematically review the existing literature on the generalization abilities of MPNNs. We analyze the strengths and limitations of various studies in these domains, providing insights into their methodologies and findings. Furthermore, we identify potential avenues for future research, aiming to deepen our understanding of the generalization abilities of MPNNs.
