On sparsity, extremal structure, and monotonicity properties of Wasserstein and Gromov-Wasserstein optimal transport plans
Titouan Vayer
TL;DR
The paper investigates how Gromov-Wasserstein (GW) optimal plans relate to classical linear OT properties. By introducing a conditionally negative semi-definite (CND) tensor framework for the GW loss, it shows that concavity of the GW objective on the coupling polytope implies the existence of sparse, extreme-point optima and tight coupling relaxations, mirroring Monge–Kantorovich equivalence under suitable conditions. It provides a detailed characterization for separable losses and gives concrete examples, notably the squared-distance and KL-divergence cases, where the CND conditions hold via Schoenberg-type results and infinite divisibility, respectively. The work also discusses a monotonicity-type property for GW arising from a linearization at the GW optimum and clarifies the limits of these properties beyond the CND regime. Overall, CND energies offer a practical lens to understand and exploit GW structure in algorithms, while noting that such properties are not universal but are commonly encountered in practice.
Abstract
This note gives a self-contained overview of some important properties of the Gromov-Wasserstein (GW) distance, compared with the standard linear optimal transport (OT) framework. More specifically, I explore the following questions: are GW optimal transport plans sparse? Under what conditions are they supported on a permutation? Do they satisfy a form of cyclical monotonicity? In particular, I present the conditionally negative semi-definite property and show that, when it holds, there are GW optimal plans that are sparse and supported on a permutation.
