Generalized Pinsker Inequality for Bregman Divergences of Negative Tsallis Entropies
Guglielmo Beretta, Tommaso Cesari, Roberto Colomboni
TL;DR
This work derives a sharp Pinsker-type inequality for the Bregman divergences $D_\alpha(p\|q)$ generated by the negative $\alpha$-Tsallis entropies on the probability simplex. It provides explicit optimal constants $C_{\alpha,K}$ across regimes that depend on $\alpha$ and the dimension $K$, revealing dimension-free behavior for $\alpha\le1$ and phase transitions at $\alpha=2$ with parity effects in $K$. For multiclass problems with $K\ge3$, a uniform bound exists only when $\alpha\le2$, while in the binary case $K=2$ a Pinsker-type bound holds for all $\alpha>2$. The results connect Tsallis losses to $\beta$-divergences, recover the classical KL Pinsker bound at $\alpha=1$, and yield the $L_1$-strong convexity constant of $-S_\alpha$, providing practical implications for online learning and surrogate-to-0-1 regret analyses in multiclass settings.
Abstract
The Pinsker inequality lower bounds the Kullback--Leibler divergence $D_{\textrm{KL}}$ in terms of total variation and provides a canonical way to convert $D_{\textrm{KL}}$ control into $\lVert \cdot \rVert_1$-control. Motivated by applications to probabilistic prediction with Tsallis losses and online learning, we establish a generalized Pinsker inequality for the Bregman divergences $D_α$ generated by the negative $α$-Tsallis entropies -- also known as $β$-divergences. Specifically, for any $p$, $q$ in the relative interior of the probability simplex $Δ^K$, we prove the sharp bound \[ D_α(p\Vert q) \ge \frac{C_{α,K}}{2}\cdot \|p-q\|_1^2, \] and we determine the optimal constant $C_{α,K}$ explicitly for every choice of $(α,K)$.
