Convexity of Mutual Information along the Fokker-Planck Flow
Jiayang Zou, Luyao Fan, Jiayang Gao, Jia Wang
TL;DR
This work analyzes how mutual information evolves along the Fokker-Planck flow, extending known convexity results from heat and OU flows to a broader FP setting. By formulating FP as a Wasserstein gradient flow and deriving the second-time derivative of mutual information, the authors show convexity under conditions where the initial density is sufficiently relatively log-concave with respect to the steady state, with ΔV ≤ (1/2) ||∇V||^2. They establish existence and uniqueness of classical FP solutions, provide Schrödinger-equation-based representations, and derive explicit derivative formulas involving backward Fisher information. The results generalize convexity properties to FP channels, including unbounded domains, and offer a framework for future exploration of weaker assumptions and broader applicability in stochastic analysis and information theory.
Abstract
We study the convexity of mutual information as a function of time along the Fokker-Planck flow. The results are generalizations of that along heat flow and Ornstein-Ulenbeck flow, which were established by A. Wibisono and V. Jog. We prove the existence and uniqueness of the classical solutions to a class of Fokker-Planck equations and then we obtain the second derivative of mutual information along the Fokker-Planck equation. If the initial distribution is sufficiently strongly log-concave compared to the steady state, then mutual information always preserves convexity under suitable conditions. In particular, if there exists some time point at which the distribution is sufficiently strongly log-concave, then mutual information will preserve convexity after that time.
