Accelerating Langevin Monte Carlo Sampling: A Large Deviations Analysis
Nian Yao, Pervez Ali, Xihua Tao, Lingjiong Zhu
TL;DR
This work develops a unified large deviations framework for generalized Langevin dynamics used in sampling from $\mu(\theta) \propto e^{-U(\theta)}$ in high dimensions. It derives an explicit rate function $I_{\tau}(\nu)$ that decomposes into symmetric and antisymmetric components, enabling direct comparison of convergence speed across overdamped and several variants, including mirror Langevin, high-order Langevin, and Hessian-free high-resolution dynamics. Each variant is shown to admit a tailored rate function (e.g., $I_{M}$, $I_{H}$, $I_{R}$) under hypoellipticity, controllability, and Lyapunov conditions, with Poisson equations linking the antisymmetric parts. The theoretical results are complemented by Bayesian logistic regression experiments on synthetic and real data, demonstrating acceleration or comparable performance for many variants given appropriate hyperparameter choices, thereby informing practical algorithm selection and tuning. Overall, the paper provides a principled, quantitative framework to analyze and compare Langevin-based samplers beyond overdamped dynamics.
Abstract
Langevin algorithms are popular Markov chain Monte Carlo methods that are often used to solve high-dimensional large-scale sampling problems in machine learning. The most classical Langevin Monte Carlo algorithm is based on the overdamped Langevin dynamics. There are many variants of Langevin dynamics that often show superior performance in practice. In this paper, we provide a unified approach to study the acceleration of the variants of the overdamped Langevin dynamics through the lens of large deviations theory. Numerical experiments using both synthetic and real data are provided to illustrate the efficiency of these variants.
