Uncovering Challenges of Solving the Continuous Gromov-Wasserstein Problem
Xavier Aramayo Carrasco, Maksim Nekrashevich, Petr Mokrov, Evgeny Burnaev, Alexander Korotin
TL;DR
This work addresses the challenging problem of continuous Gromov-Wasserstein OT, which seeks a parametric map $T^*:\mathbb{R}^{d_x}\to\mathbb{R}^{d_y}$ between unknown distributions from samples. It provides formal background on OT and GWOT, highlights that existing solvers largely rely on discrete approximations, and shows their performance deteriorates when source and target data are uncorrelated. The authors introduce NeuralGW, a neural minimax solver that does not depend on discrete GW and can scale to large datasets, offering improved performance on uncorrelated data while facing stability and initialization challenges. Overall, the paper demonstrates that data correlation heavily governs GWOT success and calls for developing reliable, general-purpose continuous GWOT solvers with practical applicability. Key contributions include a minimax reformulation of innerGW, a scalable neural architecture for $f$ and $T$, and extensive large-scale benchmarks revealing both strengths and limitations of current approaches.
Abstract
Recently, the Gromov-Wasserstein Optimal Transport (GWOT) problem has attracted the special attention of the ML community. In this problem, given two distributions supported on two (possibly different) spaces, one has to find the most isometric map between them. In the discrete variant of GWOT, the task is to learn an assignment between given discrete sets of points. In the more advanced continuous formulation, one aims at recovering a parametric mapping between unknown continuous distributions based on i.i.d. samples derived from them. The clear geometrical intuition behind the GWOT makes it a natural choice for several practical use cases, giving rise to a number of proposed solvers. Some of them claim to solve the continuous version of the problem. At the same time, GWOT is notoriously hard, both theoretically and numerically. Moreover, all existing continuous GWOT solvers still heavily rely on discrete techniques. Natural questions arise: to what extent do existing methods unravel the GWOT problem, what difficulties do they encounter, and under which conditions they are successful? Our benchmark paper is an attempt to answer these questions. We specifically focus on the continuous GWOT as the most interesting and debatable setup. We crash-test existing continuous GWOT approaches on different scenarios, carefully record and analyze the obtained results, and identify issues. Our findings experimentally testify that the scientific community is still missing a reliable continuous GWOT solver, which necessitates further research efforts. As the first step in this direction, we propose a new continuous GWOT method which does not rely on discrete techniques and partially solves some of the problems of the competitors.
