The strong data processing inequality under the heat flow
Bo'az Klartag, Or Ordentlich
TL;DR
The paper analyzes how divergences between two distributions evolve under the heat flow induced by Gaussian noise, formalizing a strong data-processing inequality with contraction coefficients η_φ(μ,s). By connecting η_χ^2 and η_KL to functional-inequality constants C_P(μ) and C_LS(μ), respectively, the authors derive explicit bounds and convexity properties, generalize de Bruijn identities to φ-divergences, and establish MMSE- and I-MMSE-based bounds that link estimation error to information loss. They introduce φ-Sobolev constants to unify contraction bounds across a broad family of divergences and prove subadditivity and Gaussian-extremality results that anchor the Gaussian case as a benchmark. The results yield practical insights for the Gaussian channel Y = X + √s Z, including half-blurring times and information-theoretic bounds that tighten previous EPI-based estimates, with implications for estimation, concentration, and information geometry. Overall, the work advances a cohesive framework tying heat-flow evolution, SDPI, and functional inequalities to quantitative bounds on information measures under Gaussian perturbations.
Abstract
Let $ν$ and $μ$ be probability distributions on $\mathbb{R}^n$, and $ν_s,μ_s$ be their evolution under the heat flow, that is, the probability distributions resulting from convolving their density with the density of an isotropic Gaussian random vector with variance $s$ in each entry. This paper studies the rate of decay of $s\mapsto D(ν_s\|μ_s)$ for various divergences, including the $χ^2$ and Kullback-Leibler (KL) divergences. We prove upper and lower bounds on the strong data-processing inequality (SDPI) coefficients corresponding to the source $μ$ and the Gaussian channel. We also prove generalizations of de Brujin's identity, and Costa's result on the concavity in $s$ of the differential entropy of $ν_s$. As a byproduct of our analysis, we obtain new lower bounds on the mutual information between $X$ and $Y=X+\sqrt{s} Z$, where $Z$ is a standard Gaussian vector in $\mathbb{R}^n$, independent of $X$, and on the minimum mean-square error (MMSE) in estimating $X$ from $Y$, in terms of the Poincaré constant of $X$.
