Table of Contents
Fetching ...

Non-asymptotic convergence bounds for modified tamed unadjusted Langevin algorithm in non-convex setting

Ariel Neufeld, Matthew Ng Cheng En, Ying Zhang

TL;DR

The paper develops a non-asymptotic analysis for a modified taming of the Unadjusted Langevin Algorithm (mTULA) to sample from $\pi_\beta \propto e^{-\beta U}$ in high dimensions, even when $U$ is non-convex with super-linearly growing gradients. It introduces a taming factor $h_\lambda(\theta)=\frac{h(\theta)}{\bigl(1+\lambda|\theta|^{2r}\bigr)^{1/2}}$ and proves explicit Wasserstein-1 and Wasserstein-2 bounds with convergence rates $O(\lambda)$ and $O(\lambda^{1/2})$, under a convex-at-infinity condition and polynomial Lipschitz assumptions. The work positions mTULA relative to existing taming approaches by removing the need for global strong convexity while achieving faster discretisation rates, supported by high-dimensional numerical experiments. These results enhance robust, non-asymptotic sampling for non-convex targets and provide guidance on step-size selection in practice.

Abstract

We consider the problem of sampling from a high-dimensional target distribution $π_β$ on $\mathbb{R}^d$ with density proportional to $θ\mapsto e^{-βU(θ)}$ using explicit numerical schemes based on discretising the Langevin stochastic differential equation (SDE). In recent literature, taming has been proposed and studied as a method for ensuring stability of Langevin-based numerical schemes in the case of super-linearly growing drift coefficients for the Langevin SDE. In particular, the Tamed Unadjusted Langevin Algorithm (TULA) was proposed in [Bro+19] to sample from such target distributions with the gradient of the potential $U$ being super-linearly growing. However, theoretical guarantees in Wasserstein distances for Langevin-based algorithms have traditionally been derived assuming strong convexity of the potential $U$. In this paper, we propose a novel taming factor and derive, under a setting with possibly non-convex potential $U$ and super-linearly growing gradient of $U$, non-asymptotic theoretical bounds in Wasserstein-1 and Wasserstein-2 distances between the law of our algorithm, which we name the modified Tamed Unadjusted Langevin Algorithm (mTULA), and the target distribution $π_β$. We obtain respective rates of convergence $\mathcal{O}(λ)$ and $\mathcal{O}(λ^{1/2})$ in Wasserstein-1 and Wasserstein-2 distances for the discretisation error of mTULA in step size $λ$. High-dimensional numerical simulations which support our theoretical findings are presented to showcase the applicability of our algorithm.

Non-asymptotic convergence bounds for modified tamed unadjusted Langevin algorithm in non-convex setting

TL;DR

The paper develops a non-asymptotic analysis for a modified taming of the Unadjusted Langevin Algorithm (mTULA) to sample from in high dimensions, even when is non-convex with super-linearly growing gradients. It introduces a taming factor and proves explicit Wasserstein-1 and Wasserstein-2 bounds with convergence rates and , under a convex-at-infinity condition and polynomial Lipschitz assumptions. The work positions mTULA relative to existing taming approaches by removing the need for global strong convexity while achieving faster discretisation rates, supported by high-dimensional numerical experiments. These results enhance robust, non-asymptotic sampling for non-convex targets and provide guidance on step-size selection in practice.

Abstract

We consider the problem of sampling from a high-dimensional target distribution on with density proportional to using explicit numerical schemes based on discretising the Langevin stochastic differential equation (SDE). In recent literature, taming has been proposed and studied as a method for ensuring stability of Langevin-based numerical schemes in the case of super-linearly growing drift coefficients for the Langevin SDE. In particular, the Tamed Unadjusted Langevin Algorithm (TULA) was proposed in [Bro+19] to sample from such target distributions with the gradient of the potential being super-linearly growing. However, theoretical guarantees in Wasserstein distances for Langevin-based algorithms have traditionally been derived assuming strong convexity of the potential . In this paper, we propose a novel taming factor and derive, under a setting with possibly non-convex potential and super-linearly growing gradient of , non-asymptotic theoretical bounds in Wasserstein-1 and Wasserstein-2 distances between the law of our algorithm, which we name the modified Tamed Unadjusted Langevin Algorithm (mTULA), and the target distribution . We obtain respective rates of convergence and in Wasserstein-1 and Wasserstein-2 distances for the discretisation error of mTULA in step size . High-dimensional numerical simulations which support our theoretical findings are presented to showcase the applicability of our algorithm.
Paper Structure (17 sections, 17 theorems, 176 equations, 1 figure, 1 table)

This paper contains 17 sections, 17 theorems, 176 equations, 1 figure, 1 table.

Key Result

Theorem 2.9

Let Assumptions assumption:1, assumption:2, assumption:3, assumption:4 hold. Then, for any $\lambda\in(0,\tilde{\lambda}_{\max})$, $n\in\mathbb{N}_0$, the mTULA algorithm $(\theta_n^\lambda)_{n\in\mathbb{N}_0}$ has the following non-asymptotic upper bound estimate in Wasserstein-1 distance: where the constants $C_0, C_1, C_2$ are given in (eqn:theorem2.7_constants_C0_C1_C2).

Figures (1)

  • Figure 1: Normalised histograms of first components of samples drawn with mTULA

Theorems & Definitions (29)

  • Remark 2.1
  • Remark 2.2
  • Remark 2.3
  • Remark 2.4
  • Remark 2.5
  • Remark 2.6
  • Remark 2.7
  • Remark 2.8
  • Theorem 2.9
  • Theorem 2.10
  • ...and 19 more