Mean square error analysis of stochastic gradient and variance-reduced sampling algorithms

Jianfeng Lu; Xuda Ye; Zhennan Zhou

Mean square error analysis of stochastic gradient and variance-reduced sampling algorithms

Jianfeng Lu, Xuda Ye, Zhennan Zhou

TL;DR

This work develops a discrete Poisson equation framework to rigorously bound mean square error (MSE) for stochastic-gradient sampling of underdamped Langevin dynamics under global convexity. It proves a first-order convergence for the numerical bias of SG-UBU, with a leading coefficient tied to the stochastic gradient variance, and reveals a phase transition for variance-reduced variants SVRG-UBU and SAGA-UBU, where the bias shifts to second-order as the step size becomes small enough. An empirical criterion is provided to guide the choice between SG-UBU and SVRG-UBU to optimize computational efficiency, supported by numerical experiments. Overall, the discrete Poisson approach offers sharp, modular MSE bounds by separately bounding stability, local errors, and variance-reduction effects, with practical implications for scalable Bayesian sampling and data-assimilation tasks.

Abstract

This paper considers mean square error (MSE) analysis for stochastic gradient sampling algorithms applied to underdamped Langevin dynamics under a global convexity assumption. A novel discrete Poisson equation framework is developed to bound the time-averaged sampling error. For the Stochastic Gradient UBU (SG-UBU) sampler, we derive an explicit MSE bound and establish that the numerical bias exhibits first-order convergence with respect to the step size $h$, with the leading error coefficient proportional to the variance of the stochastic gradient. The analysis is further extended to variance-reduced algorithms for finite-sum potentials, specifically the SVRG-UBU and SAGA-UBU methods. For these algorithms, we identify a phase transition phenomenon whereby the convergence rate of the numerical bias shifts from first to second order as the step size decreases below a critical threshold. Theoretical findings are validated by numerical experiments. In addition, the analysis provides a practical empirical criterion for selecting between the mini-batch SG-UBU and SVRG-UBU samplers to achieve optimal computational efficiency.

Mean square error analysis of stochastic gradient and variance-reduced sampling algorithms

TL;DR

Abstract

Mean square error analysis of stochastic gradient and variance-reduced sampling algorithms

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (59)