Cayley Graph Propagation
JJ Wilson, Maya Bechler-Speicher, Petar Veličković
TL;DR
The paper addresses over-squashing in graph neural networks by adopting a complete Cayley graph based propagation scheme, CGP, which interleaves message passing on the input graph with global communication on the complete Cayley graph of SL(2, Z_n). By retaining all Cayley nodes (including virtual ones) and avoiding truncation, CGP achieves bottleneck-free information flow, evidenced by improved expansion properties and reduced diameters. Empirical results across OGB, TUDataset, and Long Range Graph Benchmark show CGP outperforms Expander Graph Propagation and several graph-rewiring baselines while maintaining favorable runtime and scalability. The work demonstrates that a theoretically grounded Cayley-graph template can provide practical gains for long-range information integration in real-world graph tasks, with future directions including task-aligned Cayley edges and temporal graph settings.
Abstract
In spite of the plethora of success stories with graph neural networks (GNNs) on modelling graph-structured data, they are notoriously vulnerable to over-squashing, whereby tasks necessitate the mixing of information between distance pairs of nodes. To address this problem, prior work suggests rewiring the graph structure to improve information flow. Alternatively, a significant body of research has dedicated itself to discovering and precomputing bottleneck-free graph structures to ameliorate over-squashing. One well regarded family of bottleneck-free graphs within the mathematical community are expander graphs, with prior work -- Expander Graph Propagation (EGP) -- proposing the use of a well-known expander graph family -- the Cayley graphs of the $\mathrm{SL}(2,\mathbb{Z}_n)$ special linear group -- as a computational template for GNNs. However, in EGP the computational graphs used are truncated to align with a given input graph. In this work, we show that truncation is detrimental to the coveted expansion properties. Instead, we propose CGP, a method to propagate information over a complete Cayley graph structure, thereby ensuring it is bottleneck-free to better alleviate over-squashing. Our empirical evidence across several real-world datasets not only shows that CGP recovers significant improvements as compared to EGP, but it is also akin to or outperforms computationally complex graph rewiring techniques.
