Demystifying MPNNs: Message Passing as Merely Efficient Matrix Multiplication
Qin Jiang, Chengjia Wang, Michael Lones, Wei Pang
TL;DR
This work reframes MPNNs as memory-efficient implementations of matrix multiplication, showing that a $k$-layer GNN effectively aggregates from the $k$-hop neighborhood and is approximately equivalent to a single-layer model on $A^k$. It analyzes how different loop structures (self-loops, two-node loops, multi-node loops) alter $k$-hop connectivity and how this density impacts learning, challenging the notion that deeper GNNs fail solely due to over-smoothing. The authors reveal a structure–feature dichotomy: many datasets are, in effect, structure-only when features are uniform, with node degrees acting as the embedded feature; normalization schemes drastically influence information propagation and model behavior. They further argue that gradient-related issues, not just over-smoothing, largely explain performance degradation in sparse graphs, offering practical guidance on directed vs. undirected aggregation and normalization choices for robust GNN design and deployment.
Abstract
While Graph Neural Networks (GNNs) have achieved remarkable success, their design largely relies on empirical intuition rather than theoretical understanding. In this paper, we present a comprehensive analysis of GNN behavior through three fundamental aspects: (1) we establish that \textbf{$k$-layer} Message Passing Neural Networks efficiently aggregate \textbf{$k$-hop} neighborhood information through iterative computation, (2) analyze how different loop structures influence neighborhood computation, and (3) examine behavior across structure-feature hybrid and structure-only tasks. For deeper GNNs, we demonstrate that gradient-related issues, rather than just over-smoothing, can significantly impact performance in sparse graphs. We also analyze how different normalization schemes affect model performance and how GNNs make predictions with uniform node features, providing a theoretical framework that bridges the gap between empirical success and theoretical understanding.
