Kolmogorov-Type Maximal Inequalities for Independent and Dependent Negative Binomial Random Variables: Sharp Bounds, Sub-Exponential Refinements, and Applications to Overdispersed Count Data
Aristides V. Doumas, S. Spektor
TL;DR
This work develops Kolmogorov-type maximal inequalities for sums of Negative Binomial variables under both independence and Poisson-Gamma dependent structures. The authors derive a sharp Markov-type deviation bound for independent NB variables and a Kolmogorov-type bound with explicit Tweedie NB2 control limits, linking dispersion to GLM surveillance through $\mathrm{Var}(X_i)=\mu_i+\kappa_i\mu_i^2$. For dependent counts from a shared Gamma mixing variable, they introduce a basic Kolmogorov bound and a novel sub-exponential Bernstein-type refinement that exploits the hierarchical Poisson-Gamma structure to achieve exponential tail decay. Moment-matched Monte Carlo experiments show a 55% reduction in mean maximum deviation under dependence, with an analytical explanation grounded in the shared latent factor. An epidemiological application using NB2 parameters calibrated from COVID-19 data demonstrates the practical utility of these maximal-inequality results for monitoring overdispersed counts in public health.
Abstract
This paper develops Kolmogorov-type maximal inequalities for sums of Negative Binomial random variables under both independence and dependence structures. For independent heterogeneous Negative Binomial variables we derive sharp Markov-type deviation inequalities and Kolmogorov-type bounds expressed in terms of Tweedie dispersion parameters, providing explicit control limits for NB2 generalized linear model monitoring. For dependent count data arising through a shared Gamma mixing variable, we establish a \emph{sub-exponential Bernstein-type refinement} that exploits the Poisson-Gamma hierarchical structure to yield exponentially decaying tail probabilities -- this refinement is new in the literature. Through moment-matched Monte Carlo experiments ($n=20$, 2{,}000 replications), we document a 55\% reduction in mean maximum deviation under appropriate dependence structures, a stabilization effect we explain analytically. A concrete epidemiological application with NB2 parameters calibrated from COVID-19 surveillance data demonstrates practical utility. These results materially advance the applicability of classical maximal inequalities to overdispersed and dependent count data prevalent in public health, insurance, and ecological modeling.
