Greener GRASS: Enhancing GNNs with Encoding, Rewiring, and Attention
Tongzhou Liao, Barnabás Póczos
TL;DR
GRASS tackles robust graph learning by uniting three complementary components: random walk-based graph encoding, random rewiring via superimposed random regular graphs, and a graph-tailored additive attention. The RRWP family encodings, including the efficient D-RRWP variant, provide rich structural signals to nodes and edges, while the rewiring increases long-range connectivity and mitigates information bottlenecks; the attention mechanism leverages edge representations to drive selective, directionally aware aggregation. Through extensive benchmarking and ablations, GRASS demonstrates state-of-the-art or competitive performance across standard GNN datasets and long-range graph benchmarks, with notable gains on ZINC (e.g., MAE improvement of $20.3 ext{%}$). The work also discusses its scalability, stochastic outputs, and reproducibility, highlighting practical trade-offs and providing a public codebase for replication and future research. Overall, GRASS offers a cohesive framework that enhances GNNs by synergistically encoding structure, enriching topology, and refining attention to graph-structured data.
Abstract
Graph Neural Networks (GNNs) have become important tools for machine learning on graph-structured data. In this paper, we explore the synergistic combination of graph encoding, graph rewiring, and graph attention, by introducing Graph Attention with Stochastic Structures (GRASS), a novel GNN architecture. GRASS utilizes relative random walk probabilities (RRWP) encoding and a novel decomposed variant (D-RRWP) to efficiently capture structural information. It rewires the input graph by superimposing a random regular graph to enhance long-range information propagation. It also employs a novel additive attention mechanism tailored for graph-structured data. Our empirical evaluations demonstrate that GRASS achieves state-of-the-art performance on multiple benchmark datasets, including a 20.3% reduction in mean absolute error on the ZINC dataset.
