Generalizing Weisfeiler-Lehman Kernels to Subgraphs
Dongkwan Kim, Alice Oh
TL;DR
This work tackles subgraph-level representation learning by addressing the expressiveness gaps of traditional GNNs. It introduces WLKS, a Weisfeiler-Lehman kernel generalized to subgraphs via induced $k$-hop neighborhoods, and merges information across $k$-hops, notably using $k\in\{0,D\}$ to balance expressiveness and efficiency. The authors prove that multi-$k$ combinations can be more expressive than any single $k$, and demonstrate that the kernel is PSD. Empirically, WLKS outperforms several state-of-the-art baselines on five of eight datasets while offering substantial training-time advantages and avoiding GPUs or pretraining, with extensions to continuous features and potential GNN integrations. This provides a practical, scalable alternative to deep GNNs for subgraph-level tasks, with code available for community use.
Abstract
Subgraph representation learning has been effective in solving various real-world problems. However, current graph neural networks (GNNs) produce suboptimal results for subgraph-level tasks due to their inability to capture complex interactions within and between subgraphs. To provide a more expressive and efficient alternative, we propose WLKS, a Weisfeiler-Lehman (WL) kernel generalized for subgraphs by applying the WL algorithm on induced $k$-hop neighborhoods. We combine kernels across different $k$-hop levels to capture richer structural information that is not fully encoded in existing models. Our approach can balance expressiveness and efficiency by eliminating the need for neighborhood sampling. In experiments on eight real-world and synthetic benchmarks, WLKS significantly outperforms leading approaches on five datasets while reducing training time, ranging from 0.01x to 0.25x compared to the state-of-the-art.
