HoGA: Higher-Order Graph Attention via Diversity-Aware k-Hop Sampling
Thomas Bailie, Yun Sing Koh, Karthik Mukkavilli
TL;DR
Edge-based MPNNs face limited expressivity for higher-order topology. HoGA introduces a diversity-aware $k$-hop sampling module that builds $K$-hop attention by walking on the $k$-order line graph $L_k(G)$ and aggregating via $oldsymbol{A}_{1:K}(oldsymbol{x}(t))= extstyle\sum_{1\le k\le K}\beta(k)oldsymbol{A}_k(oldsymbol{x}(t),oldsymbol{S}_k)$, with a history-buffer guided heuristic to maximize feature diversity. The sampling strategy reduces redundancy and oversquashing while remaining tractable under a budget $ig|Eig|$, and HoGA can be plugged into existing single-hop models such as GAT and GRAND to yield HoGA-GAT and HoGA-GRAND. Empirically, HoGA improves node classification accuracy across both homophilic and heterophilic benchmarks, often outperforming recent higher-order methods, while maintaining reasonable runtime and memory requirements. This approach provides a scalable pathway to leverage richer topological signals in graphs without prohibitive state-space growth.
Abstract
Graphs model latent variable relationships in many real-world systems, and Message Passing Neural Networks (MPNNs) are widely used to learn such structures for downstream tasks. While edge-based MPNNs effectively capture local interactions, their expressive power is theoretically bounded, limiting the discovery of higher-order relationships. We introduce the Higher-Order Graph Attention (HoGA) module, which constructs a k-order attention matrix by sampling subgraphs to maximize diversity among feature vectors. Unlike existing higher-order attention methods that greedily resample similar k-order relationships, HoGA targets diverse modalities in higher-order topology, reducing redundancy and expanding the range of captured substructures. Applied to two single-hop attention models, HoGA achieves at least a 5% accuracy gain on all benchmark node classification datasets and outperforms recent baselines on six of eight datasets. Code is available at https://github.com/TB862/Higher_Order.
