Generalization Error of Graph Neural Networks in the Mean-field Regime
Gholamali Aminian, Yixuan He, Gesine Reinert, Łukasz Szpruch, Samuel N. Cohen
TL;DR
The work tackles the challenge of understanding generalization in over-parameterized graph neural networks for graph classification. It adopts a mean-field framework and a KL-regularized empirical risk to derive an analytically tractable Gibbs measure for network parameters, delivering two complementary analyses: a functional-derivative bound and a Rademacher-complexity bound, both yielding an $O\left(\frac{1}{n}\right)$ rate in the mean-field limit. The results are specialized to over-parameterized one-hidden-layer GCNs and MPGNNs, with explicit bounds that depend on graph properties and readout/aggregation choices, and show how graph filters influence generalization. Empirical validation on synthetic and real-world graphs supports the theory, offering concrete guidance on network width and readout design while highlighting promising directions for extending the framework to deeper or hypergraph architectures.
Abstract
This work provides a theoretical framework for assessing the generalization error of graph neural networks in the over-parameterized regime, where the number of parameters surpasses the quantity of data points. We explore two widely utilized types of graph neural networks: graph convolutional neural networks and message passing graph neural networks. Prior to this study, existing bounds on the generalization error in the over-parametrized regime were uninformative, limiting our understanding of over-parameterized network performance. Our novel approach involves deriving upper bounds within the mean-field regime for evaluating the generalization error of these graph neural networks. We establish upper bounds with a convergence rate of $O(1/n)$, where $n$ is the number of graph samples. These upper bounds offer a theoretical assurance of the networks' performance on unseen data in the challenging over-parameterized regime and overall contribute to our understanding of their performance.
