Generalization of Graph Neural Networks through the Lens of Homomorphism
Shouheng Li, Dongwoo Kim, Qing Wang
TL;DR
The paper tackles the challenge of understanding GNN generalization by introducing an entropy-based perspective on graph homomorphisms. By linking homomorphism counts to information-theoretic measures and incorporating a Wasserstein-margin viewpoint, it derives non-vacuous, data-dependent bounds that apply across $1$-WL, $k$-WL, HI-GNN, and SI-GNN variants within a unified $\\mathcal{F}$-MPNN framework. The approach yields graph- and node-level generalization bounds that reflect pattern- and depth-related effects on GNN expressivity, and the authors validate these bounds against both real and synthetic datasets. Overall, the results provide a principled, structure-aware tool for predicting and understanding generalization gaps in GNNs, with practical implications for pattern selection and model design.
Abstract
Despite the celebrated popularity of Graph Neural Networks (GNNs) across numerous applications, the ability of GNNs to generalize remains less explored. In this work, we propose to study the generalization of GNNs through a novel perspective - analyzing the entropy of graph homomorphism. By linking graph homomorphism with information-theoretic measures, we derive generalization bounds for both graph and node classifications. These bounds are capable of capturing subtleties inherent in various graph structures, including but not limited to paths, cycles and cliques. This enables a data-dependent generalization analysis with robust theoretical guarantees. To shed light on the generality of of our proposed bounds, we present a unifying framework that can characterize a broad spectrum of GNN models through the lens of graph homomorphism. We validate the practical applicability of our theoretical findings by showing the alignment between the proposed bounds and the empirically observed generalization gaps over both real-world and synthetic datasets.
