What Expressivity Theory Misses: Message Passing Complexity for GNNs
Niklas Kemper, Tom Wollschläger, Stephan Günnemann
TL;DR
The paper argues that expressivity theories based on the WL test are insufficient to explain real-world GNN performance and introduces Message Passing Complexity (MPC), a continuous, task-specific measure derived from a probabilistic variant of WL (lossyWL). MPC quantifies how difficult it is for a given architecture to solve a graph task via message passing, incorporating practical constraints like over-squashing and under-reaching while preserving WL-impossibility results. The authors prove theoretical properties linking MPC to expressivity, refinement, and compositionality, and validate MPC across tasks (retaining information, propagating information, ring detection) on synthetic and real graphs, showing strong alignment with empirical performance. They show that performance improvements often stem from architectural biases that lower task-specific MPC rather than from universal increases in expressivity, suggesting a shift in design focus toward minimizing MPC for domain-specific tasks. These results offer a principled framework to diagnose and guide GNN architecture design in practical settings.
Abstract
Expressivity theory, characterizing which graphs a GNN can distinguish, has become the predominant framework for analyzing GNNs, with new models striving for higher expressivity. However, we argue that this focus is misguided: First, higher expressivity is not necessary for most real-world tasks as these tasks rarely require expressivity beyond the basic WL test. Second, expressivity theory's binary characterization and idealized assumptions fail to reflect GNNs' practical capabilities. To overcome these limitations, we propose Message Passing Complexity (MPC): a continuous measure that quantifies the difficulty for a GNN architecture to solve a given task through message passing. MPC captures practical limitations like over-squashing while preserving the theoretical impossibility results from expressivity theory, effectively narrowing the gap between theory and practice. Through extensive validation on fundamental GNN tasks, we show that MPC's theoretical predictions correlate with empirical performance, successfully explaining architectural successes and failures. Thereby, MPC advances beyond expressivity theory to provide a more powerful and nuanced framework for understanding and improving GNN architectures.
