Homomorphism Counts as Structural Encodings for Graph Learning
Linus Bao, Emily Jin, Michael Bronstein, İsmail İlkan Ceylan, Matthias Lanzinger
TL;DR
The paper addresses the limited expressivity of common graph encodings in Graph Transformers and proposes motif structural encoding (MoSE), a flexible framework that encodes node structure via counts of graph homomorphisms. MoSE is parameterized by a motif family and a node weighting, and it subsumes RWSE as a special case while offering greater expressivity beyond fixed WL levels. The authors provide theoretical results linking MoSE to the Weisfeiler–Lehman hierarchy, demonstrate that MoSE can distinguish graphs that RWSE cannot, and empirically validate substantial performance gains across molecular and image-derived graph tasks, achieving state-of-the-art results on ZINC-12K and strong results on QM9 and synthetic data. The work offers a general, scalable approach to infusing graph structure into Graph Transformers with broad applicability across domains.
Abstract
Graph Transformers are popular neural networks that extend the well-known Transformer architecture to the graph domain. These architectures operate by applying self-attention on graph nodes and incorporating graph structure through the use of positional encodings (e.g., Laplacian positional encoding) or structural encodings (e.g., random-walk structural encoding). The quality of such encodings is critical, since they provide the necessary $\textit{graph inductive biases}$ to condition the model on graph structure. In this work, we propose $\textit{motif structural encoding}$ (MoSE) as a flexible and powerful structural encoding framework based on counting graph homomorphisms. Theoretically, we compare the expressive power of MoSE to random-walk structural encoding and relate both encodings to the expressive power of standard message passing neural networks. Empirically, we observe that MoSE outperforms other well-known positional and structural encodings across a range of architectures, and it achieves state-of-the-art performance on a widely studied molecular property prediction dataset.
