Asymptotically Optimal Sequential Testing with Markovian Data
Alhad Sethi, Kavali Sofia Sagar, Shubhada Agrawal, Debabrota Basu, P. N. Karthik
TL;DR
The paper introduces a rigorously developed framework for one-sided sequential hypothesis testing with Markovian data under composite null and composite alternative hypotheses. It establishes a non-asymptotic, instance-dependent lower bound on the stopping time that scales with $\log(1/α)$ divided by the stationary-weighted KL divergence $D_{\mathcal{M}}^{\inf}(Q, 𝒫)$, and provides an asymptotically optimal test that attains this bound to first order as $α\to 0$. Central technical tools include a Pinsker-type inequality for Markov divergences via the Poisson equation and uniform control of Poisson solutions, enabling sharp finite-sample guarantees. The framework is demonstrated through applications to MCMC misspecification detection and to testing linearity in MDPs, illustrating both theoretical sharpness and practical applicability. The work thus yields a sharp, general characterization of optimal sequential testing procedures under Markovian dependence and opens avenues for extension to broader dependence models and non-convex hypothesis classes.
Abstract
We study one-sided and $α$-correct sequential hypothesis testing for data generated by an ergodic Markov chain. The null hypothesis is that the unknown transition matrix belongs to a prescribed set $P$ of stochastic matrices, and the alternative corresponds to a disjoint set $Q$. We establish a tight non-asymptotic instance-dependent lower bound on the expected stopping time of any valid sequential test under the alternative. Our novel analysis improves the existing lower bounds, which are either asymptotic or provably sub-optimal in this setting. Our lower bound incorporates both the stationary distribution and the transition structure induced by the unknown Markov chain. We further propose an optimal test whose expected stopping time matches this lower bound asymptotically as $α\to 0$. We illustrate the usefulness of our framework through applications to sequential detection of model misspecification in Markov Chain Monte Carlo and to testing structural properties, such as the linearity of transition dynamics, in Markov decision processes. Our findings yield a sharp and general characterization of optimal sequential testing procedures under Markovian dependence.
