On Confidence Sequences for Bounded Random Processes via Universal Gambling Strategies
J. Jon Ryu, Alankrita Bhatt
TL;DR
This work advances time-uniform confidence sequences for the mean of bounded processes by reframing the problem as a gambling game over a two-horse race. It formalizes a principled mixture approach that closely emulates Cover's universal portfolio with constant per-round complexity, and introduces a higher-order lower-bound technique (LBUP) to achieve near-UP performance with scalable computation. The paper provides exact mixture-wealth formulations, proves interval-type confidence sets via Ville’s inequality, and demonstrates through experiments that the proposed methods yield tight, practically useful confidence sequences, including a hybrid strategy that blends UP and LBUP for the best of both worlds. The results have practical impact for sequential mean estimation under boundedness, offering efficient, robust tools for time-adaptive inference and safe stopping rules in online settings.
Abstract
This paper considers the problem of constructing a confidence sequence, which is a sequence of confidence intervals that hold uniformly over time, for estimating the mean of bounded real-valued random processes. This paper revisits the gambling-based approach established in the recent literature from a natural \emph{two-horse race} perspective, and demonstrates new properties of the resulting algorithm induced by Cover (1991)'s universal portfolio. The main result of this paper is a new algorithm based on a mixture of lower bounds, which closely approximates the performance of Cover's universal portfolio with constant per-round time complexity. A higher-order generalization of a lower bound on a logarithmic function in (Fan et al., 2015), which is developed as a key technique for the proposed algorithm, may be of independent interest.
