Non-geodesically-convex optimization in the Wasserstein space
Hoang Phuc Hau Luu, Hanlin Yu, Bernardo Williams, Petrus Mikkola, Marcelo Hartmann, Kai Puolamäki, Arto Klami
TL;DR
This work studies optimization over the Wasserstein space $\mathcal{P}_2(\mathbb{R}^d)$ for a non geodesically convex objective $\mathcal{F}(\mu)=\mathcal{E}_{F}(\mu)+\mathscr{H}(\mu)$ with $F=G-H$ DC and $\mathscr{H}$ convex along generalized geodesics. It introduces a semi Forward-Backward Euler scheme that alternates a forward step on the DC concave part and a backward JKO step on the convex part, exploiting Brenier maps to obtain convergence guarantees even when $F$ is nonconvex. The authors establish asymptotic and nonasymptotic rates for the Wasserstein gradient mapping and Fréchet subdifferentials, and show global convergence under a Łojasiewicz-type inequality with explicit rates in three regimes of the exponent $\theta$, plus KL-based convergence when $\mathscr{H}$ is the negative entropy. They also provide practical transport-map based implementations using input convex neural networks and illustrate the approach on nonconvex sampling tasks such as Gaussian mixtures and distance-to-set priors.
Abstract
We study a class of optimization problems in the Wasserstein space (the space of probability measures) where the objective function is nonconvex along generalized geodesics. Specifically, the objective exhibits some difference-of-convex structure along these geodesics. The setting also encompasses sampling problems where the logarithm of the target distribution is difference-of-convex. We derive multiple convergence insights for a novel semi Forward-Backward Euler scheme under several nonconvex (and possibly nonsmooth) regimes. Notably, the semi Forward-Backward Euler is just a slight modification of the Forward-Backward Euler whose convergence is -- to our knowledge -- still unknown in our very general non-geodesically-convex setting.
