A semiconcavity approach to stability of entropic plans and exponential convergence of Sinkhorn's algorithm
Alberto Chiarini, Giovanni Conforti, Giacomo Greco, Luca Tamanini
TL;DR
The paper develops a semiconcavity-based framework for stability of entropic OT plans and for exponential convergence of Sinkhorn's algorithm in unbounded settings. By showing that a $\Lambda$-semiconcavity bound on the entropic potential $c(x,\cdot)+\psi_ε^{\nu}(\cdot)$ yields a KL stability bound $\mathrm{KL}(\pi_ε^{\rho\mu}|\pi_ε^{\rho\nu}) \le \mathrm{KL}(\mu|\nu) + \frac{\Lambda}{2ε}\mathbf{W}_2^2(\mu,\nu)$, the authors couple this with Talagrand-type transport inequalities to obtain geometric convergence of Sinkhorn iterates. The approach extends to $(\Lambda,\omega)$-semiconcavity and generalized TI, enabling exponential convergence across a wide range of marginals and costs, including log-concave, weakly log-concave, light-tailed, anisotropic quadratic, Lipschitz, and manifold settings, often with rates linear in the regularization parameter $ε$. The results sharpen previous bounds by relaxing Hessian-boundedness requirements and providing explicit, sometimes sharp, dependence on problem data, tail behavior, and geometric structure. The work thus broadens the applicability of entropic OT in unbounded domains and informs practical performance of Sinkhorn-type algorithms in diverse contexts. Overall, the paper advances both theory and potential applications of entropic OT by linking semiconcavity propagation to stability and fast convergence.
Abstract
We study stability of optimizers and convergence of Sinkhorn's algorithm for the entropic optimal transport problem. In the special case of the quadratic cost, our stability bounds imply that if one of the two entropic potentials is semiconcave, then the relative entropy between optimal plans is controlled by the squared Wasserstein distance between their marginals. When employed in the analysis of Sinkhorn's algorithm, this result gives a natural sufficient condition for its exponential convergence, which does not require the ground cost to be bounded. By controlling from above the Hessians of Sinkhorn potentials in examples of interest, we obtain new exponential convergence results. For instance, for the first time we obtain exponential convergence for log-concave marginals and quadratic costs for all values of the regularization parameter, based on semiconcavity propagation results. Moreover, the convergence rate has a linear dependence on the regularization: this behavior is sharp and had only been previously obtained for compact distributions arXiv:2407.01202. These optimal rates are also established in situations where one of the two marginals does not have subgaussian tails. Other interesting new applications include subspace elastic costs, weakly log-concave marginals, marginals with light tails (where, under reinforced assumptions, we manage to improve the rates obtained in arXiv:2311.04041), the case of Lipschitz costs with bounded Hessian, and compact Riemannian manifolds.
