Decoupled Functional Central Limit Theorems for Two-Time-Scale Stochastic Approximation
Yuze Han, Xiang Li, Jiadong Liang, Zhihua Zhang
TL;DR
This work establishes decoupled functional central limit theorems for two-time-scale stochastic approximation, showing that rescaled trajectories of the fast and slow updates converge to stationary Ornstein–Uhlenbeck processes with dynamics governed by linear generators and Lyapunov covariances. The authors construct continuous-time trajectories by rescaling the inner and outer errors by the square roots of their step sizes and introduce an auxiliary sequence to mitigate cross-scale interference, enabling a clean diffusion limit. The results extend classical CLTs for SA to trajectory-level FCLTs in a two-time-scale setting and recover standard CLTs in special cases, with implications for precise stochastic behavior in bilevel optimization, TD learning, and related algorithms. The approach hinges on tightness, martingale problems, and careful handling of residual terms, providing a framework for online inference and future generalizations to Markovian or more intricate noise structures.
Abstract
In two-time-scale stochastic approximation (SA), two iterates are updated at different rates, governed by distinct step sizes, with each update influencing the other. Previous studies have demonstrated that the convergence rates of the error terms for these updates depend solely on their respective step sizes, a property known as decoupled convergence. However, a functional version of this decoupled convergence has not been explored. Our work fills this gap by establishing decoupled functional central limit theorems for two-time-scale SA, offering a more precise characterization of its asymptotic behavior. To achieve these results, we leverage the martingale problem approach and establish tightness as a crucial intermediate step. Furthermore, to address the interdependence between different time scales, we introduce an innovative auxiliary sequence to eliminate the primary influence of the fast-time-scale update on the slow-time-scale update.
