Improving the Expressiveness of $K$-hop Message-Passing GNNs by Injecting Contextualized Substructure Information
Tianjun Yao, Yiongxu Wang, Kun Zhang, Shangsong Liang
TL;DR
This work identifies a fundamental expressivity limit of standard MPGNNs, bounded by the $1$-WL test, and shows that $K$-hop message-passing struggles to capture internal substructure within $K$-hop ego-nets. It introduces a substructure encoding function $f$ and contextualized substructure information, culminating in the SEK-1-WL color refinement and SEK-GNN, which are provably more powerful than $K$-hop $1$-WL and Subgraph $1$-WL and competitive with or surpassing $3$-WL. The encoding relies on an efficient random-walk-based feature set, including self-return probabilities and related landing probabilities, enabling scalable, parallelizable computation. Empirically, SEK-GNN achieves state-of-the-art or competitive results on synthetic benchmarks, graph classification suites, and QM9 molecular properties, while maintaining lower space complexity than many subgraph-based methods. The work therefore provides a practical, theoretically grounded path to more expressive GNNs without incurring prohibitive costs, with potential extensions to other subgraph-based architectures and encodings.
Abstract
Graph neural networks (GNNs) have become the \textit{de facto} standard for representational learning in graphs, and have achieved state-of-the-art performance in many graph-related tasks; however, it has been shown that the expressive power of standard GNNs are equivalent maximally to 1-dimensional Weisfeiler-Lehman (1-WL) Test. Recently, there is a line of works aiming to enhance the expressive power of graph neural networks. One line of such works aim at developing $K$-hop message-passing GNNs where node representation is updated by aggregating information from not only direct neighbors but all neighbors within $K$-hop of the node. Another line of works leverages subgraph information to enhance the expressive power which is proven to be strictly more powerful than 1-WL test. In this work, we discuss the limitation of $K$-hop message-passing GNNs and propose \textit{substructure encoding function} to uplift the expressive power of any $K$-hop message-passing GNN. We further inject contextualized substructure information to enhance the expressiveness of $K$-hop message-passing GNNs. Our method is provably more powerful than previous works on $K$-hop graph neural networks and 1-WL subgraph GNNs, which is a specific type of subgraph based GNN models, and not less powerful than 3-WL. Empirically, our proposed method set new state-of-the-art performance or achieves comparable performance for a variety of datasets. Our code is available at \url{https://github.com/tianyao-aka/Expresive_K_hop_GNNs}.
