Statistical inference for Linear Stochastic Approximation with Markovian Noise
Sergey Samsonov, Marina Sheshukova, Eric Moulines, Alexey Naumov
TL;DR
This work derives non-asymptotic Berry–Esseen bounds for Polyak–Ruppert averaged iterates in Linear Stochastic Approximation under Markov noise, achieving a Kolmogorov distance rate of $O(n^{-1/4})$ to the Gaussian limit. The authors employ a Poisson decomposition and martingale Berry–Esseen techniques to bound the projected error and establish a finite-sample distributional approximation for LSA. They also prove non-asymptotic validity of a multiplier subsample bootstrap for constructing confidence intervals, with a coverage-approximation rate of $O(n^{-1/10})$, and recover the $O(n^{-1/8})$ rate for the overlapping batch means estimator of the asymptotic variance. The methodology applies to TD learning and informs practical choices for step sizes and bootstrap tuning, enabling reliable uncertainty quantification in SA with Markov noise. Potential extensions include multivariate, nonlinear SA and refined constants.
Abstract
In this paper we derive non-asymptotic Berry-Esseen bounds for Polyak-Ruppert averaged iterates of the Linear Stochastic Approximation (LSA) algorithm driven by the Markovian noise. Our analysis yields $\mathcal{O}(n^{-1/4})$ convergence rates to the Gaussian limit in the Kolmogorov distance. We further establish the non-asymptotic validity of a multiplier block bootstrap procedure for constructing the confidence intervals, guaranteeing consistent inference under Markovian sampling. Our work provides the first non-asymptotic guarantees on the rate of convergence of bootstrap-based confidence intervals for stochastic approximation with Markov noise. Moreover, we recover the classical rate of order $\mathcal{O}(n^{-1/8})$ up to logarithmic factors for estimating the asymptotic variance of the iterates of the LSA algorithm.
