Provable Convergence and Limitations of Geometric Tempering for Langevin Dynamics
Omar Chehab, Anna Korba, Austin Stromme, Adrien Vacher
TL;DR
This work analyzes geometric tempering for Langevin-based sampling, providing the first KL-based convergence bounds for tempered Langevin dynamics along a general tempering path and revealing that the method can both improve and degrade convergence depending on problem conditioning. It derives continuous- and discrete-time upper bounds that depend on the log-Sobolev constants of intermediate tempered distributions and characterizes an optimal tempering schedule in the strongly log-concave regime. Crucially, the paper shows that geometric tempering can exponentially worsen functional inequalities and, in certain multi- or even unimodal settings, lead to exponential slowdowns in convergence, challenging the assumption that tempering always helps. The results underscore the need for careful choice of tempering paths and schedules and motivate exploring alternative moving-target strategies beyond the geometric path. Overall, the findings provide a nuanced, theory-backed view of when geometric tempering is beneficial and when it may be detrimental for Langevin sampling.
Abstract
Geometric tempering is a popular approach to sampling from challenging multi-modal probability distributions by instead sampling from a sequence of distributions which interpolate, using the geometric mean, between an easier proposal distribution and the target distribution. In this paper, we theoretically investigate the soundness of this approach when the sampling algorithm is Langevin dynamics, proving both upper and lower bounds. Our upper bounds are the first analysis in the literature under functional inequalities. They assert the convergence of tempered Langevin in continuous and discrete-time, and their minimization leads to closed-form optimal tempering schedules for some pairs of proposal and target distributions. Our lower bounds demonstrate a simple case where the geometric tempering takes exponential time, and further reveal that the geometric tempering can suffer from poor functional inequalities and slow convergence, even when the target distribution is well-conditioned. Overall, our results indicate that geometric tempering may not help, and can even be harmful for convergence.
