Upper and lower bounds on TVD and KLD between centered elliptical distributions in high-dimensional setting
Ievlev Pavel, Timofei Shashkov
TL;DR
This work develops high‑dimensional bounds on the total variation distance $D_{\,mathrm{TV}}$ and the Kullback–Leibler divergence $D_{\,mathrm{KL}}$ between centered elliptical distributions. A covariance‑reduction approach reduces general comparisons to the identity and diagonal covariance cases, enabling bounds expressed through incomplete Gamma/Beta functions and the Lambert $W$‑function. The authors instantiate these bounds for the multivariate Student‑t versus multivariate normal case, and for independent‑component Gamma laws (the latter focusing on TVD), providing explicit formulas and computable thresholds that hold in large $n$. The results facilitate robust, tractable comparisons in high dimensions when exact densities are unavailable, with potential implications for hypothesis testing, privacy, and model selection. The methodology hinges on $f$‑divergence techniques, sublevel set analyses, and CLT‑style approximations, yielding practically usable bounds even as the dimension grows.
Abstract
In this paper, we derive some upper and lower bounds and inequalities for the total variation distance (TVD) and the Kullback-Leibler divergence (KLD), also known as the relative entropy, between two probability measures $μ$ and $ν$ defined by $$ D_{\mathrm{TV}} ( μ, ν) = \sup_{B \in \mathcal{B} (\mathbb{R}^n)} \left| μ(B) - ν(B) \right| \quad \text{and} \quad D_{\mathrm{KL}} ( μ\, \| \, ν) = \int_{\mathbb{R}^n} \ln \left( \frac{dμ(x)}{dν(x)} \right) \, μ(dx) $$ correspondingly when the dimension $n$ is high. We begin with some elementary bounds for centered elliptical distributions admitting densities and showcase how these bounds may be used by estimating the TVD and KLD between multivariate Student and multivariate normal distribution in the high-dimensional setting. Next, we show how the same approach simplifies when we apply it to multivariate Gamma distributions with independent components (in the latter case, we only study the TVD, because KLD may be calculated explicitly, see [1]). Our approach is motivated by the recent contribution by Barabesi and Pratelli [2].
