Sampling from multi-modal distributions with polynomial query complexity in fixed dimension via reverse diffusion
Adrien Vacher, Omar Chehab, Anna Korba
TL;DR
This work tackles the challenge of drawing samples from unnormalized, multi-modal densities μ ∝ e^{-V} in fixed dimension. It introduces a reverse-diffusion based sampler that converts sampling into score estimation along the forward Ornstein–Uhlenbeck process and uses a self-normalized Monte Carlo estimator to approximate intermediate scores from access to V only. Under semi-log-convexity and dissipativity, the authors prove a non-asymptotic, polynomial-time guarantee for obtaining samples with small KL error, without requiring prior knowledge of problem constants and without metastability. They further show that general Gaussian mixtures satisfy the required regularity, yielding practical polynomial bounds in mixture-conditioned parameters, and demonstrate favorable empirical performance against standard baselines. The results bridge diffusion-based sampling theory with concrete guarantees for low-dimensional multi-modality, with potential impact on Bayesian inference and probabilistic modeling in settings where multi-modal posteriors are common.
Abstract
Even in low dimensions, sampling from multi-modal distributions is challenging. We provide the first sampling algorithm for a broad class of distributions -- including all Gaussian mixtures -- with a query complexity that is polynomial in the parameters governing multi-modality, assuming fixed dimension. Our sampling algorithm simulates a time-reversed diffusion process, using a self-normalized Monte Carlo estimator of the intermediate score functions. Unlike previous works, it avoids metastability, requires no prior knowledge of the mode locations, and relaxes the well-known log-smoothness assumption which excluded general Gaussian mixtures so far.
