Efficient estimation of the modified Gromov-Hausdorff distance between unweighted graphs
Vladyslav Oles, Nathan Lemons, Alexander Panchenko
TL;DR
The paper tackles the computational intractability of the Gromov–Hausdorff distance by focusing on its modified variant $\widehat{d}_{\mathcal{GH}}$ and leveraging curvature sets into a structural decomposition. It develops polynomial-time lower-bound procedures and an algorithm to estimate $\widehat{d}_{\mathcal{GH}}$, with a concrete implementation for metric spaces induced by unweighted graphs in the scikit-tda library. Empirical results on real-world networks (e.g., Enron, LANL, ABIDE I) and synthetic graphs show frequent exact recovery and meaningful distance-based outlier detection, highlighting practical usefulness for graph-shape matching and anomaly analysis. The work delivers a scalable tool for comparing graph shapes and identifying atypical network structures, with potential applications in network science and beyond.
Abstract
Gromov-Hausdorff distances measure shape difference between the objects representable as compact metric spaces, e.g. point clouds, manifolds, or graphs. Computing any Gromov-Hausdorff distance is equivalent to solving an NP-Hard optimization problem, deeming the notion impractical for applications. In this paper we propose polynomial algorithm for estimating the so-called modified Gromov-Hausdorff (mGH) distance, whose topological equivalence with the standard Gromov-Hausdorff (GH) distance was established in Mémoli F, 2012. We implement the algorithm for the case of compact metric spaces induced by unweighted graphs as part of Python library $\verb|scikit-tda|$, and demonstrate its performance on real-world and synthetic networks. The algorithm finds the mGH distances exactly on most graphs with the scale-free property. We use the computed mGH distances to successfully detect outliers in real-world social and computer networks.
