The Wasserstein gradient flow of the Sinkhorn divergence between Gaussian distributions
Mathis Hardion, Théo Lacombe
TL;DR
This work analyzes the Wasserstein gradient flow of the Sinkhorn divergence $S_\varepsilon(\cdot,\mu_\star)$ when both source and target are Gaussian, establishing existence, Gaussian-invariance, and uniqueness (within a regular measure class) of the flow. It derives explicit mean and covariance dynamics, proving global convergence to the target when the initial covariance is non-singular and detailing limit behavior when singular or commuting covariances arise; in the commuting case, it shows exponential convergence for full-support targets and $O(t^{-1})$ rates when the target lies in a subspace. The results connect closed-form Gaussian Sinkhorn formulas with the Bures–Wasserstein geometry, yielding precise eigenvalue evolutions and energy-dissipation relations, and are complemented by explicit time-discretization schemes and numerical experiments. Collectively, the paper provides a rigorous partial convergence theory for Gaussian Sinkhorn flows and offers practical guidance for simulations and potential extensions to broader settings.
Abstract
We study the Wasserstein gradient flow of the Sinkhorn divergence when both the source and the target are Gaussian distributions. We prove the existence of a flow that stays in the class of Gaussian distributions, and is unique in the larger class of measures with strongly-concave and smooth log-densities. We prove that the flow globally converges toward the target measure when the source's covariance matrix is not singular, and provide counter-examples to global convergence when it is, giving a first answer to an open question raised in [Carlier et al. 2024, \S4.2]. When the covariance matrix of the source distribution commutes with that of the target, we derive more quantitative results that showcase exponential convergence toward the target when the source and the target share their support, but dropping to linear rates (O(t^{-1})) if the target is concentrated on a strict subspace of the source's support.
