Interpolating between Optimal Transport and KL regularized Optimal Transport using Rényi Divergences
Jonas Bresch, Viktor Stein
TL;DR
This work introduces a Rényi-divergence-based regularization of optimal transport to interpolate between the classical unregularized OT and KL-regularized OT. It develops two reformulations: a premetric defined by Rényi-ball constraints and a penalized objective, and derives a dual framework with explicit primal plans expressed via dual potentials. The authors prove fundamental properties, including existence/uniqueness of minimizers and interpolation theorems governing limits as $α→0,1$ and $ε→0,∞$, along with a nested mirror-descent algorithm for computation. Empirical results on synthetic and real data show that Rényi-regularized OT plans are tighter and closer to unregularized OT than KL or Tsallis counterparts, with improved ground-truth recovery in tasks such as voter migration forecasting.
Abstract
Regularized optimal transport (OT) has received much attention in recent years starting from Cuturi's introduction of Kullback-Leibler (KL) divergence regularized OT. In this paper, we propose regularizing the OT problem using the family of $α$-Rényi divergences for $α\in (0, 1)$. Rényi divergences are neither $f$-divergences nor Bregman distances, but they recover the KL divergence in the limit $α\nearrow 1$. The advantage of introducing the additional parameter $α$ is that for $α\searrow 0$ we obtain convergence to the unregularized OT problem. For the KL regularized OT problem, this was achieved by letting the regularization parameter $\varepsilon$ tend to zero, which causes numerical instabilities. We present two different ways to obtain premetrics on probability measures, namely by Rényi divergence constraints and by penalization. The latter premetric interpolates between the unregularized and the KL regularized OT problem with weak convergence of the unique minimizer, generalizing the interpolation property of KL regularized OT. We use a nested mirror descent algorithm to solve the primal formulation. Both on real and synthetic data sets Rényi regularized OT plans outperform their KL and Tsallis counterparts in terms of being closer to the unregularized transport plans and recovering the ground truth in inference tasks better.
