Projected Langevin Monte Carlo algorithms in non-convex and super-linear setting

Chenxu Pang; Xiaojie Wang; Yue Wu

Projected Langevin Monte Carlo algorithms in non-convex and super-linear setting

Chenxu Pang, Xiaojie Wang, Yue Wu

TL;DR

An explicit projected Langevin Monte Carlo (PLMC) algorithm with non-convex potential $U$ and super-linear gradient of $U$ is proposed and the non-asymptotic analysis of its sampling error in total variation distance is investigated and the smallest number of iterations of the PLMC algorithm is proved to be of order.

Abstract

It is of significant interest in many applications to sample from a high-dimensional target distribution $π$ with the density $π(\text{d} x) \propto e^{-U(x)} (\text{d} x) $, based on the temporal discretization of the Langevin stochastic differential equations (SDEs). In this paper, we propose an explicit projected Langevin Monte Carlo (PLMC) algorithm with non-convex potential $U$ and super-linear gradient of $U$ and investigate the non-asymptotic analysis of its sampling error in total variation distance. Equipped with time-independent regularity estimates for the associated Kolmogorov equation, we derive the non-asymptotic bounds on the total variation distance between the target distribution of the Langevin SDEs and the law induced by the PLMC scheme with order $\mathcal{O}(d^{\max\{3γ/2 , 2γ-1 \}} h |\ln h|)$, where $d$ is the dimension of the target distribution and $γ\geq 1$ characterizes the growth of the gradient of $U$. In addition, if the gradient of $U$ is globally Lipschitz continuous, an improved convergence order of $\mathcal{O}(d^{3/2} h)$ for the classical Langevin Monte Carlo (LMC) scheme is derived with a refinement of the proof based on Malliavin calculus techniques. To achieve a given precision $ε$, the smallest number of iterations of the PLMC algorithm is proved to be of order ${\mathcal{O}}\big(\tfrac{d^{\max\{3γ/2 , 2γ-1 \}}}ε \ \cdot \ln (\tfrac{d}ε) \cdot \ln (\tfrac{1}ε) \big)$. In particular, the classical Langevin Monte Carlo (LMC) scheme with the non-convex potential $U$ and the globally Lipschitz gradient of $U$ can be guaranteed by order ${\mathcal{O}}\big(\tfrac{d^{3/2}}ε \cdot \ln (\tfrac{1}ε) \big)$. Numerical experiments are provided to confirm the theoretical findings.

Projected Langevin Monte Carlo algorithms in non-convex and super-linear setting

TL;DR

An explicit projected Langevin Monte Carlo (PLMC) algorithm with non-convex potential

and super-linear gradient of

is proposed and the non-asymptotic analysis of its sampling error in total variation distance is investigated and the smallest number of iterations of the PLMC algorithm is proved to be of order.

Abstract

It is of significant interest in many applications to sample from a high-dimensional target distribution

with the density

, based on the temporal discretization of the Langevin stochastic differential equations (SDEs). In this paper, we propose an explicit projected Langevin Monte Carlo (PLMC) algorithm with non-convex potential

and super-linear gradient of

and investigate the non-asymptotic analysis of its sampling error in total variation distance. Equipped with time-independent regularity estimates for the associated Kolmogorov equation, we derive the non-asymptotic bounds on the total variation distance between the target distribution of the Langevin SDEs and the law induced by the PLMC scheme with order

, where

is the dimension of the target distribution and

characterizes the growth of the gradient of

. In addition, if the gradient of

is globally Lipschitz continuous, an improved convergence order of

for the classical Langevin Monte Carlo (LMC) scheme is derived with a refinement of the proof based on Malliavin calculus techniques. To achieve a given precision

, the smallest number of iterations of the PLMC algorithm is proved to be of order

. In particular, the classical Langevin Monte Carlo (LMC) scheme with the non-convex potential

and the globally Lipschitz gradient of

can be guaranteed by order

. Numerical experiments are provided to confirm the theoretical findings.

Paper Structure (19 sections, 16 theorems, 191 equations, 4 figures, 5 tables)

This paper contains 19 sections, 16 theorems, 191 equations, 4 figures, 5 tables.

Introduction
Settings and main results
Preliminary results
A priori estimates of the Langevin SDE
A priori estimates of the PLMC algorithm
Kolmogorov equation and regularization estimates
Proof of Theorem \ref{['theorem:main-result-paper3']}: time-independent weak error analysis
Optimal convergence rate for the Lipschitz case $\gamma = 1$
Introduction to Malliavin calculus
Proof of Theorem \ref{['theorem:main-result-paper3-optimal']}: optimal weak convergence rate
Numerical experiments
Proof of Lemmas in Section \ref{['section:Preliminary-results-paper3']}
Proof of Lemma \ref{['lemma:Uniform-moment-bounds-of-the-Langevin-SDE-paper3']}
Proof of lemmas in Section \ref{['section:Kolmogorov-equation-and-regularization-estimates']}
Proof of Lemma \ref{['lemma:differentiability-of-solutions-paper3']}
...and 4 more sections

Key Result

Theorem 2.6

(Main result: non-asymptotic bounds in the total variation distance) Assume Assumptions assumption:globally-polynomial-growth-condition-paper3, assumption:contractivity-at-infinity-condition-paper3, assumption:coercivity-condition-of-the-drift-paper3. Let $\{X^{x_{0}}_{t}\}_{t \geq 0}$ and $\{Y^{x_{

Figures (4)

Figure 1: Probability density of the first component of the double-well model.
Figure 2: Weak convergence rates of PLMC algorithm of the double well model for $d=6$ (Top) and $d=10$ (Bottom).
Figure 3: Weak convergence rates of PLMC algorithm of the double well model for $d=50$ (Top) and $d=100$ (Bottom).
Figure 4: Dimension dependence of weak errors of PLMC algorithm.

Theorems & Definitions (35)

Remark 2.3
Example 2.5: Double-well potential
Theorem 2.6
Theorem 2.7
Remark 2.8
Proposition 2.9
Lemma 3.1
Lemma 3.2
proof : Proof of Lemma \ref{['lemma:Existence-and-uniqueness-of-the-invariant-measure-paper3']}
Lemma 3.3
...and 25 more

Projected Langevin Monte Carlo algorithms in non-convex and super-linear setting

TL;DR

Abstract

Projected Langevin Monte Carlo algorithms in non-convex and super-linear setting

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (4)

Theorems & Definitions (35)