Gromov-Wasserstein Graph Coarsening
Carlos A. Taveras, Santiago Segarra, César A. Uribe
TL;DR
This work advances graph coarsening by framing it through the Gromov-Wasserstein distance, enabling unaligned graphs of different sizes to be coarsened while preserving global relational structure. It introduces two algorithms: Greedy Pair Coarsening (GPC), which greedily merges node pairs to recover the minimal GW-representative, and KGPC, a scalable variant using a distortion-based, $k$-means clustering approach on a pairwise distortion matrix $H$. The authors provide theoretical guarantees for GPC under separation conditions and demonstrate empirical advantages over baselines in reconstruction distortion and downstream graph classification across multiple datasets. The proposed GW-centric coarsening framework offers a principled, geometry-aware tool for reducing graph size in non-Euclidean data while maintaining essential topology and transport-based alignments.
Abstract
We study the problem of graph coarsening within the Gromov-Wasserstein geometry. Specifically, we propose two algorithms that leverage a novel representation of the distortion induced by merging pairs of nodes. The first method, termed Greedy Pair Coarsening (GPC), iteratively merges pairs of nodes that locally minimize a measure of distortion until the desired size is achieved. The second method, termed $k$-means Greedy Pair Coarsening (KGPC), leverages clustering based on pairwise distortion metrics to directly merge clusters of nodes. We provide conditions guaranteeing optimal coarsening for our methods and validate their performance on six large-scale datasets and a downstream clustering task. Results show that the proposed methods outperform existing approaches on a wide range of parameters and scenarios.
