Transport information Bregman divergences
Wuchen Li
TL;DR
This work extends the concept of Bregman divergences to the $L^2$-Wasserstein probability-density space by introducing transport Bregman divergences $\mathrm{D}_{\mathrm{T},\mathcal{F}}$ and the transport KL divergence $\mathrm{D}_{\mathrm{TKL}}$ as a KL-like object in Wasserstein geometry. It establishes a general formulation via the optimal transport map $T$ and $\nabla_x\Phi_p$, provides multiple equivalent representations (mapping-based and joint-density-based), and derives tractable closed-form expressions in one dimension and for Gaussian families. The paper proves key properties such as nonnegativity under displacement convexity, a transport Hessian metric, linearity, and asymmetry, and introduces a symmetrized transport Jensen–Shannon divergence. These results illuminate how information-geometric notions interact with optimal transport and point to practical applications in AI inference and optimization within Wasserstein spaces.
Abstract
We study Bregman divergences in probability density space embedded with the $L^2$-Wasserstein metric. Several properties and dualities of transport Bregman divergences are provided. In particular, we derive the transport Kullback-Leibler (KL) divergence by a Bregman divergence of negative Boltzmann-Shannon entropy in $L^2$-Wasserstein space. We also derive analytical formulas and generalizations of transport KL divergence for one-dimensional probability densities and Gaussian families.
