Tamed Langevin sampling under weaker conditions
Iosif Lytras, Panayotis Mertikopoulos
TL;DR
This work addresses sampling from targets with non-Lipschitz log-gradients under weak dissipativity by introducing two taming schemes, wd-TULA and reg-TULA, that work under $PI$ (with and without weak convexity) and $LSI$ regimes. The authors derive non-asymptotic convergence guarantees in KL divergence, total variation, and Wasserstein distances, using a differential-inequality framework and isoperimetric inequalities, while achieving polynomial dependence on dimension under $PI$ and optimal rates under $LSI$. A regularized taming approach extends the results to the PI-only setting by constructing a regularized target that satisfies a Poincaré inequality and, under suitable conditions, a Log-Sobolev inequality. Numerical experiments on a high-dimensional double-well potential corroborate the theoretical findings and demonstrate the practical stability of wd-TULA compared to vanilla ULA and standard TULA.
Abstract
Motivated by applications to deep learning which often fail standard Lipschitz smoothness requirements, we examine the problem of sampling from distributions that are not log-concave and are only weakly dissipative, with log-gradients allowed to grow superlinearly at infinity. In terms of structure, we only assume that the target distribution satisfies either a log-Sobolev or a Poincaré inequality and a local Lipschitz smoothness assumption with modulus growing possibly polynomially at infinity. This set of assumptions greatly exceeds the operational limits of the "vanilla" unadjusted Langevin algorithm (ULA), making sampling from such distributions a highly involved affair. To account for this, we introduce a taming scheme which is tailored to the growth and decay properties of the target distribution, and we provide explicit non-asymptotic guarantees for the proposed sampler in terms of the Kullback-Leibler (KL) divergence, total variation, and Wasserstein distance to the target distribution.
