WL Tests Are Far from All We Need: Revisiting WL-Test Hardness and GNN Expressive Power from a Distributed Computation Perspective
Guanyu Cui, Yuhe Guo, Zhewei Wei, Hsin-Hao Su
TL;DR
This work questions the WL-based lens on GNN expressivity by adopting a distributed-computation perspective. It delivers near-tight bounds on the depth $d$, width $w$, and precision $p$ required to simulate a single WL iteration under deterministic, Las Vegas, and bounded-error randomized regimes, demonstrating that WL iterations are not always primitive for constant-depth GNNs. The authors introduce a generalized Resource-Limited CONGEST (RL-CONGEST) framework with an explicit preprocessing phase to standardize expressivity analyses and quantify how preprocessing choices, including virtual nodes and edges, influence WL behavior. They further show that certain preprocessing strategies can inject implicit shortcuts, while other configurations can meaningfully alter the depth-width-precision requirements, offering nuanced guidance for studying GNN expressivity beyond WL equivalence and for designing GNN architectures under resource constraints.
Abstract
The expressive power of graph neural networks (GNNs) is often studied through their relationship to the Weisfeiler-Lehman (WL) tests. Despite its influence, this perspective leaves two gaps: (i) it is unclear whether WL tests are sufficiently primitive for understanding GNN expressivity, and (ii) WL-induced equivalence does not align well with characterizing the function classes that GNNs can approximate or compute. We attempt to address both gaps. First, we strengthen hardness results for the vanilla WL test, showing that in many settings it is not primitive enough to be implemented by constant-depth GNNs. Second, we propose an alternative framework for studying GNN expressivity based on an extended CONGEST model with an explicit preprocessing phase. Within this framework, we identify implicit shortcuts introduced in prior analyses and establish further results for WL tests in settings where graphs are augmented with virtual nodes and virtual edges.
