Sheaf Graph Neural Networks via PAC-Bayes Spectral Optimization
Yoonhyuk Choi, Jiho Choi, Chong-Kwon Kim
TL;DR
SGPC introduces a unified framework for learning cellular-sheaf GNNs with PAC-Bayes calibration, combining a Wasserstein-Entropic Lift for restriction maps, SVR diffusion, and adaptive frequency mixing to robustly handle heterophily. The approach yields a spectrum-aware, bound-guided objective that comes with convergence guarantees, monotone spectral-gap growth, and risk-variance contraction, enabling end-to-end training with linear-time complexity. Theoretical contributions establish a tight PAC-Bayes generalization bound for cellular-sheaf GNNs and quantify diffusion stability via the spectral gap. Empirically, SGPC achieves state-of-the-art results across nine benchmarks with calibrated uncertainty intervals, demonstrating strong robustness to over-smoothing and heterophily while remaining scalable to large graphs.
Abstract
Over-smoothing in Graph Neural Networks (GNNs) causes collapse in distinct node features, particularly on heterophilic graphs where adjacent nodes often have dissimilar labels. Although sheaf neural networks partially mitigate this problem, they typically rely on static or heavily parameterized sheaf structures that hinder generalization and scalability. Existing sheaf-based models either predefine restriction maps or introduce excessive complexity, yet fail to provide rigorous stability guarantees. In this paper, we introduce a novel scheme called SGPC (Sheaf GNNs with PAC-Bayes Calibration), a unified architecture that combines cellular-sheaf message passing with several mechanisms, including optimal transport-based lifting, variance-reduced diffusion, and PAC-Bayes spectral regularization for robust semi-supervised node classification. We establish performance bounds theoretically and demonstrate that end-to-end training in linear computational complexity can achieve the resulting bound-aware objective. Experiments on nine homophilic and heterophilic benchmarks show that SGPC outperforms state-of-the-art spectral and sheaf-based GNNs while providing certified confidence intervals on unseen nodes. The code and proofs are in https://github.com/ChoiYoonHyuk/SGPC.
