Time-complexity of sampling from a multimodal distribution using sequential Monte Carlo
Ruiyu Han, Gautam Iyer, Dejan Slepčev
TL;DR
This work develops a rigorous analysis of Annealed Sequential Monte Carlo (ASMC) for sampling from Gibbs measures at low temperature in non-convex landscapes, using geometric annealing and Langevin steps at each level. By exploiting local valley mixing and a carefully designed resampling scheme, the authors prove that ASMC estimators converge with time complexity that is polynomial in the inverse temperature and the desired accuracy, with a distinctive fourth-power scaling in $1/eta$ and a squared dependence on $1/ ext{err}$ in certain regimes. They establish two main model frameworks: a Local Mixing Model with explicit constants and a double-well energy scenario on the torus, the latter analyzed via spectral properties of the Langevin generator to obtain polynomial bounds under nondegeneracy and mass-ratio assumptions. The paper also provides numerical experiments illustrating the level-wise mass balancing and error behavior, and surveys related tempering and AIS/SMC methods to position ASMC as a dimension-independent, structure-agnostic approach to multimodal sampling. Overall, the results suggest a practically efficient route to low-temperature sampling in multimodal settings where global mixing is prohibitively slow.
Abstract
We study a sequential Monte Carlo algorithm to sample from the Gibbs measure with a non-convex energy function at a low temperature. We use the practical and popular geometric annealing schedule, and use a Langevin diffusion at each temperature level. The Langevin diffusion only needs to run for a time that is long enough to ensure local mixing within energy valleys, which is much shorter than the time required for global mixing. Our main result shows convergence of Monte Carlo estimators with time complexity that, approximately, scales like the fourth power of the inverse temperature, and the square of the inverse allowed error. We also study this algorithm in an illustrative model scenario where more explicit estimates can be given.
