Gromov-Wasserstein-like Distances in the Gaussian Mixture Models Space
Antoine Salmona, Julie Delon, Agnès Desolneux
TL;DR
This work introduces two isometry-invariant, Gromov-Wasserstein-type OT distances for Gaussian mixtures: MGW2, a Gromovization of the Mixture-Wasserstein distance, and EW2 (and its GMM specialization MEW2) to obtain explicit transport plans between GMMs. MGW2 reduces the GW problem to a small-scale discrete coupling over components, enabling efficient distance computation across different dimensions, while EW2 provides a structured way to derive transport maps by optimizing an isometric transformation. The authors propose annealing strategies and plan-design tricks to mitigate nonconvexity, and demonstrate competitive performance in shape matching and hyperspectral color transfer on medium-to-large scale problems, highlighting practical advantages over traditional GW solvers. The results suggest that MGW2 and EW2 offer scalable, invariance-aware tools for comparing clustered distributions and extracting correspondences in high-dimensional spaces. The work highlights practical impact for tasks with intrinsic clustering, cross-domain comparisons, and applications requiring robust mapping between heterogeneous data shapes.
Abstract
The Gromov-Wasserstein (GW) distance is frequently used in machine learning to compare distributions across distinct metric spaces. Despite its utility, it remains computationally intensive, especially for large-scale problems. Recently, a novel Wasserstein distance specifically tailored for Gaussian mixture models (GMMs) and known as MW2 (mixture Wasserstein) has been introduced by several authors. In scenarios where data exhibit clustering, this approach simplifies to a small-scale discrete optimal transport problem, which complexity depends solely on the number of Gaussian components in the GMMs. This paper aims to incorporate invariance properties into MW2. This is done by introducing new Gromov-type distances, designed to be isometry-invariant in Euclidean spaces and applicable for comparing GMMs across different dimensional spaces. Our first contribution is the Mixture Gromov Wasserstein distance (MGW2), which can be viewed as a "Gromovized" version of MW2. This new distance has a straightforward discrete formulation, making it highly efficient for estimating distances between GMMs in practical applications. To facilitate the derivation of a transport plan between GMMs, we present a second distance, the Embedded Wasserstein distance (EW2). This distance turns out to be closely related to several recent alternatives to Gromov-Wasserstein. We show that EW2 can be adapted to derive a distance as well as optimal transportation plans between GMMs. We demonstrate the efficiency of these newly proposed distances on medium to large-scale problems, including shape matching and hyperspectral image color transfer.
