Gromov-Wasserstein at Scale, Beyond Squared Norms
Guillaume Houry, Jean Feydy, François-Xavier Vialard
TL;DR
This work tackles the scalability and rotation-sensitivity of Gromov-Wasserstein matching by introducing CNT costs, which admit a lifted feature-space embedding making GW behave like a linear-algebraic alignment problem. The authors derive a dual formulation showing GW_{ abla} with CNT costs reduces to a coupled optimization over a linear map Γ and an optimal transport plan in lifted spaces, enabling an alternating minimization that is memory-efficient and time-feasible for large point sets. They establish entropic debiasing and convergence guarantees for CNT-EGW, and present practical solvers (CNT-GW, Kernel-GW, and Multiscale-GW MsGW) with linear memory and quadratic time complexity, scalable to hundreds of thousands of points. Empirically, CNT-based methods outperform state-of-the-art GW solvers by large margins, enable GW barycenters and landscape visualization, and reach speeds sufficient for near-real-time registration on datasets with up to ~177k points, highlighting broad applicability to high-resolution geometric tasks.
Abstract
A fundamental challenge in data science is to match disparate point sets with each other. While optimal transport efficiently minimizes point displacements under a bijectivity constraint, it is inherently sensitive to rotations. Conversely, minimizing distortions via the Gromov-Wasserstein (GW) framework addresses this limitation but introduces a non-convex, computationally demanding optimization problem. In this work, we identify a broad class of distortion penalties that reduce to a simple alignment problem within a lifted feature space. Leveraging this insight, we introduce an iterative GW solver with a linear memory footprint and quadratic (rather than cubic) time complexity. Our method is differentiable, comes with strong theoretical guarantees, and scales to hundreds of thousands of points in minutes. This efficiency unlocks a wide range of geometric applications and enables the exploration of the GW energy landscape, whose local minima encode the symmetries of the matching problem.
