Codivergences and information matrices
Alexis Derumigny, Johannes Schmidt-Hieber
TL;DR
This work introduces codivergence as a local, bilinear notion of angle between probability measures around a reference $P_0$, enabling directional comparison of $P_1$ and $P_2$. It defines covariance-type $V_\phi$ and correlation-type $R_\phi$ codivergences, with central instances including the $χ^2$-codivergence and the Hellinger codivergence, and shows these induce divergence matrices that closely resemble Gram matrices of a tangent-like space. The authors derive explicit $R_α$ expressions for common parametric families—multivariate normal, Poisson, Bernoulli, and Gamma—and establish a data-processing inequality for the $χ^2$-divergence matrix, highlighting robustness under Markov kernels. They also analyze rank properties and connect the local codivergence structure to the nonparametric Fisher information metric, offering a versatile framework for lower bounds and information-geometry-inspired analysis in both finite and infinite-dimensional settings.
Abstract
We propose a new concept of codivergence, which quantifies the similarity between two probability measures $P_1, P_2$ relative to a reference probability measure $P_0$. In the neighborhood of the reference measure $P_0$, a codivergence behaves like an inner product between the measures $P_1 - P_0$ and $P_2 - P_0$. Codivergences of covariance-type and correlation-type are introduced and studied with a focus on two specific correlation-type codivergences, the $χ^2$-codivergence and the Hellinger codivergence. We derive explicit expressions for several common parametric families of probability distributions. For a codivergence, we introduce moreover the divergence matrix as an analogue of the Gram matrix. It is shown that the $χ^2$-divergence matrix satisfies a data-processing inequality.
