Accelerating Langevin Monte Carlo Sampling: A Large Deviations Analysis

Nian Yao; Pervez Ali; Xihua Tao; Lingjiong Zhu

Accelerating Langevin Monte Carlo Sampling: A Large Deviations Analysis

Nian Yao, Pervez Ali, Xihua Tao, Lingjiong Zhu

TL;DR

This work develops a unified large deviations framework for generalized Langevin dynamics used in sampling from $\mu(\theta) \propto e^{-U(\theta)}$ in high dimensions. It derives an explicit rate function $I_{\tau}(\nu)$ that decomposes into symmetric and antisymmetric components, enabling direct comparison of convergence speed across overdamped and several variants, including mirror Langevin, high-order Langevin, and Hessian-free high-resolution dynamics. Each variant is shown to admit a tailored rate function (e.g., $I_{M}$, $I_{H}$, $I_{R}$) under hypoellipticity, controllability, and Lyapunov conditions, with Poisson equations linking the antisymmetric parts. The theoretical results are complemented by Bayesian logistic regression experiments on synthetic and real data, demonstrating acceleration or comparable performance for many variants given appropriate hyperparameter choices, thereby informing practical algorithm selection and tuning. Overall, the paper provides a principled, quantitative framework to analyze and compare Langevin-based samplers beyond overdamped dynamics.

Abstract

Langevin algorithms are popular Markov chain Monte Carlo methods that are often used to solve high-dimensional large-scale sampling problems in machine learning. The most classical Langevin Monte Carlo algorithm is based on the overdamped Langevin dynamics. There are many variants of Langevin dynamics that often show superior performance in practice. In this paper, we provide a unified approach to study the acceleration of the variants of the overdamped Langevin dynamics through the lens of large deviations theory. Numerical experiments using both synthetic and real data are provided to illustrate the efficiency of these variants.

Accelerating Langevin Monte Carlo Sampling: A Large Deviations Analysis

TL;DR

This work develops a unified large deviations framework for generalized Langevin dynamics used in sampling from

in high dimensions. It derives an explicit rate function

that decomposes into symmetric and antisymmetric components, enabling direct comparison of convergence speed across overdamped and several variants, including mirror Langevin, high-order Langevin, and Hessian-free high-resolution dynamics. Each variant is shown to admit a tailored rate function (e.g.,

) under hypoellipticity, controllability, and Lyapunov conditions, with Poisson equations linking the antisymmetric parts. The theoretical results are complemented by Bayesian logistic regression experiments on synthetic and real data, demonstrating acceleration or comparable performance for many variants given appropriate hyperparameter choices, thereby informing practical algorithm selection and tuning. Overall, the paper provides a principled, quantitative framework to analyze and compare Langevin-based samplers beyond overdamped dynamics.

Accelerating Langevin Monte Carlo Sampling: A Large Deviations Analysis

TL;DR

Abstract

Accelerating Langevin Monte Carlo Sampling: A Large Deviations Analysis

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (4)

Theorems & Definitions (35)