Tight Non-asymptotic Inference via Sub-Gaussian Intrinsic Moment Norm
Huiming Zhang, Haoyu Wei, Guang Cheng
TL;DR
This work addresses tight non-asymptotic inference under sub-Gaussian data by introducing the intrinsic moment norm $\|X\|_G$, which tightly approximates MGF bounds and enables sharper concentration than traditional variance proxies. It develops practical estimation methods, including a plug-in estimator and robust MOM-based estimators, along with small-sample techniques such as leave-one-out Hodges-Lehmann methods, all coordinated through the novel sub-Gaussian plot to diagnose sub-Gaussianity. The authors extend these tools to reinforcement learning, notably a BeUCB algorithm for multi-armed bandits with provable regret bounds that match minimax rates up to constants. Overall, the approach provides a robust, estimable, and theoretically sharp framework for non-asymptotic uncertainty quantification in sub-Gaussian settings with concrete applications to bandits and RL.
Abstract
In non-asymptotic learning, variance-type parameters of sub-Gaussian distributions are of paramount importance. However, directly estimating these parameters using the empirical moment generating function (MGF) is infeasible. To address this, we suggest using the sub-Gaussian intrinsic moment norm [Buldygin and Kozachenko (2000), Theorem 1.3] achieved by maximizing a sequence of normalized moments. Significantly, the suggested norm can not only reconstruct the exponential moment bounds of MGFs but also provide tighter sub-Gaussian concentration inequalities. In practice, we provide an intuitive method for assessing whether data with a finite sample size is sub-Gaussian, utilizing the sub-Gaussian plot. The intrinsic moment norm can be robustly estimated via a simple plug-in approach. Our theoretical findings are also applicable to reinforcement learning, including the multi-armed bandit scenario.
