Optimal sub-Gaussian variance proxy for 3-mass distributions
Soufiane Atouani, Olivier Marchal, Julyan Arbel
TL;DR
This work develops a general, variation-based characterization of the optimal sub-Gaussian variance proxy $\sigma^2_{ ext{opt}}$ for real-valued random variables, focusing on discrete, equidistant distributions. Central to the method is the function $g_Y(\lambda;\sigma^2)=\tfrac{1}{2}\lambda^2\sigma^2-M_Y(\lambda)$ and the equation system $g_Y(\lambda;\sigma^2)=0$, $g_Y'(\lambda;\sigma^2)=0$, which reduces the search to zeros of $\lambda M_Y'(\lambda)-2M_Y(\lambda)=0$ and verification of local minimality. The paper derives a complete picture for 3-mass distributions: a phase transition at $p=1/6$ in the symmetric case where $\sigma^2_{ ext{opt}}=\operatorname{Var}[Y]$ for $p\geq 1/6$, and explicit closed forms in several asymmetric regimes, with a nonlinear-equation characterization when the central mass is large. It also proves that the discrete uniform distribution on $N$ points is strictly sub-Gaussian with $\sigma^2_{ ext{opt}}=\operatorname{Var}[X]$, and provides an open-source Python package that blends analytic and numerical techniques to compute optimal variance proxies for a wide class of distributions, aiding probabilistic tail control and Bayesian analysis. Together, these results advance discrete tail analysis and offer practical tools for sharp concentration bounds. $\sigma^2_{ ext{opt}}$ is thus connected to the cumulant structure via $M_Y$, enabling precise tailoring of concentration guarantees in discrete settings.
Abstract
We investigate the problem of characterizing the optimal variance proxy for sub-Gaussian random variables,whose moment-generating function exhibits bounded growth at infinity. We apply a general characterization method to discrete random variables with equally spaced atoms. We thoroughly study 3-mass distributions, thereby generalizing the well-studied Bernoulli case. We also prove that the discrete uniform distribution over $N$ points is strictly sub-Gaussian. Finally, we provide an open-source Python package that combines analytical and numerical approaches to compute optimal sub-Gaussian variance proxies across a wide range of distributions.
