Graph Learning with Distributional Edge Layouts
Xinjian Zhao, Chaolong Ying, Tianshu Yu
TL;DR
This work introduces Distributional Edge Layouts (DELs), a physics-inspired pre-processing technique for graph neural networks that samples a distribution of edge layouts from a Boltzmann energy surface using Langevin dynamics: ${\mathbb{P}}(\mathbf{C})=(1/Z) e^{-E(\mathbf{C})/\alpha}$. By converting steady-state layouts into edge features and embedding them into GNNs through edge-aware mechanisms, DEL captures a broader energy landscape and provides additional expressivity beyond the 1-WL test, while remaining architecture-agnostic. Empirically, DEL-enhanced backbones (GAT, Graph Transformer, GPS) achieve substantial improvements across six datasets, with analyses showing stable isomorphic layouts and discriminative power for certain non-isomorphic graphs. The approach is scalable as a pre-processing step and opens the door to leveraging global layout distributions to augment a wide range of GNN models in practice.
Abstract
Graph Neural Networks (GNNs) learn from graph-structured data by passing local messages between neighboring nodes along edges on certain topological layouts. Typically, these topological layouts in modern GNNs are deterministically computed (e.g., attention-based GNNs) or locally sampled (e.g., GraphSage) under heuristic assumptions. In this paper, we for the first time pose that these layouts can be globally sampled via Langevin dynamics following Boltzmann distribution equipped with explicit physical energy, leading to higher feasibility in the physical world. We argue that such a collection of sampled/optimized layouts can capture the wide energy distribution and bring extra expressivity on top of WL-test, therefore easing downstream tasks. As such, we propose Distributional Edge Layouts (DELs) to serve as a complement to a variety of GNNs. DEL is a pre-processing strategy independent of subsequent GNN variants, thus being highly flexible. Experimental results demonstrate that DELs consistently and substantially improve a series of GNN baselines, achieving state-of-the-art performance on multiple datasets.
