Estimation of discrete distributions with high probability under $χ^2$-divergence
Sirine Louati
TL;DR
The paper studies high-probability estimation of a discrete distribution under $\chi^2$-divergence using i.i.d. samples. It analyzes the classical Laplace (add-one) estimator, deriving sharp upper and lower bounds, and introduces a confidence-dependent smoothing scheme to improve rates. The authors show that the Laplace estimator is minimax optimal among confidence-independent estimators, and they characterize minimax optimal rates with and without confidence calibration, revealing an intrinsic overhead in non-asymptotic guarantees. These results yield practical smoothing strategies and tighten theoretical understanding of divergence-based estimation in high dimensions.
Abstract
We investigate the high-probability estimation of discrete distributions from an \iid sample under $χ^2$-divergence loss. Although the minimax risk in expectation is well understood, its high-probability counterpart remains largely unexplored. We provide sharp upper and lower bounds for the classical Laplace estimator, showing that it achieves optimal performance among estimators that do not rely on the confidence level. We further characterize the minimax high-probability risk for any estimator and demonstrate that it can be attained through a simple smoothing strategy. Our analysis highlights an intrinsic separation between asymptotic and non-asymptotic guarantees, with the latter suffering from an unavoidable overhead. This work sharpens existing guarantees and advances the theoretical understanding of divergence-based estimation.
