User-friendly guarantees for the Langevin Monte Carlo with inaccurate gradient
Arnak S. Dalalyan, Avetik G. Karagulyan
TL;DR
The paper addresses nonasymptotic sampling guarantees for Langevin Monte Carlo (LMC) targeting smooth, strongly log-concave densities π(θ) ∝ e^{-f(θ)} by analyzing Wasserstein-2 convergence. It develops user-friendly bounds for LMC with both constant and varying step sizes, extends guarantees to inaccurate gradient evaluations, and introduces second-order variants LMCO and LMCO' that leverage Hessian information to achieve improved rates in ill-conditioned regimes. The results cover mixtures of log-concave components (MLMC) and demonstrate that gradient noise can be tolerated with explicit bias and variance terms, while still yielding dimension-aware convergence rates. The work also clarifies connections between sampling and optimization, showing how diffusion-based sampling recovers gradient-descent and Newton-method behaviors in appropriate limits, and suggests practical guidelines for choosing step-sizes and iteration counts in high-dimensional settings. Overall, the contributions offer practical, nearly dimension-optimal guarantees for a broad family of Langevin-based samplers, including robust handling of approximate gradients and second-order discretizations.
Abstract
In this paper, we study the problem of sampling from a given probability density function that is known to be smooth and strongly log-concave. We analyze several methods of approximate sampling based on discretizations of the (highly overdamped) Langevin diffusion and establish guarantees on its error measured in the Wasserstein-2 distance. Our guarantees improve or extend the state-of-the-art results in three directions. First, we provide an upper bound on the error of the first-order Langevin Monte Carlo (LMC) algorithm with optimized varying step-size. This result has the advantage of being horizon free (we do not need to know in advance the target precision) and to improve by a logarithmic factor the corresponding result for the constant step-size. Second, we study the case where accurate evaluations of the gradient of the log-density are unavailable, but one can have access to approximations of the aforementioned gradient. In such a situation, we consider both deterministic and stochastic approximations of the gradient and provide an upper bound on the sampling error of the first-order LMC that quantifies the impact of the gradient evaluation inaccuracies. Third, we establish upper bounds for two versions of the second-order LMC, which leverage the Hessian of the log-density. We provide nonasymptotic guarantees on the sampling error of these second-order LMCs. These guarantees reveal that the second-order LMC algorithms improve on the first-order LMC in ill-conditioned settings.
