Homomorphism distortion: A metric to distinguish them all and in the latent space bind them
Martin Carrasco, Olga Zaghen, Erik Bekkers, Bastian Rieck
TL;DR
This work introduces graph homomorphism distortion (HD), a continuous, permutation-invariant distance on vertex-attributed graphs that addresses the binary expressivity limitations of WL tests and standard GNNs. The authors formalize d_{HD} as a (pseudo)-metric based on maximal attribute deviations under homomorphisms between graphs, and show that, under mild assumptions, it yields a complete graph embedding and can be approximated via sampling with an expectation-complete formulation. They prove theoretical properties and demonstrate empirical effectiveness, including full distinction of graphs in the BRE C benchmark and superior performance over prior homomorphism- and subgraph-count-based methods on ZINC-12k in learning setups. By enabling a continuous, characteristic-based representation of graphs, the approach offers a principled path toward richer graph characterizations and practical tools for graph representation learning.
Abstract
For far too long, expressivity of graph neural networks has been measured \emph{only} in terms of combinatorial properties. In this work we stray away from this tradition and provide a principled way to measure similarity between vertex attributed graphs. We denote this measure as the \emph{graph homomorphism distortion}. We show it can \emph{completely characterize} graphs and thus is also a \emph{complete graph embedding}. However, somewhere along the road, we run into the graph canonization problem. To circumvent this obstacle, we devise to efficiently compute this measure via sampling, which in expectation ensures \emph{completeness}. Additionally, we also discovered that we can obtain a metric from this measure. We validate our claims empirically and find that the \emph{graph homomorphism distortion}: (1.) fully distinguishes the \texttt{BREC} dataset with up to $4$-WL non-distinguishable graphs, and (2.) \emph{outperforms} previous methods inspired in homomorphisms under the \texttt{ZINC-12k} dataset. These theoretical results, (and their empirical validation), pave the way for future characterization of graphs, extending the graph theoretic tradition to new frontiers.
