Sampling, Diffusions, and Stochastic Localization
Andrea Montanari
TL;DR
This work presents a unified framework that connects diffusion-based sampling with stochastic localization, showing how a broad class of observation-driven processes (Y_t) yields martingale posterior measures μ_t that converge to the target μ. By recasting reverse-diffusion sampling as stochastic localization, it derives many concrete schemes (isotropic/anisotropic Gaussian, erasure, binary/symmetric, linear, information percolation, Poisson, half-space, and Euclidean-invariant variants) and clarifies how the choice of observation process and neural denoisers shapes efficiency and accuracy. It also explores the interaction between sampling schemes and data architecture, illustrating how to tailor schemes to capture long-range correlations in images and how symmetry considerations guide kernel design. The paper discusses practical aspects such as approximating transition probabilities, learning from samples, preserving problem symmetries, avoiding problematic regions or phase transitions, and incorporating latent variables, offering a roadmap for building scalable, structure-aware diffusion-based samplers. Overall, it provides a versatile, unified lens for designing, analyzing, and applying diffusion and localization-based sampling methods across continuous and discrete settings.
Abstract
Diffusions are a successful technique to sample from high-dimensional distributions. The target distribution can be either explicitly given or learnt from a collection of samples. They implement a diffusion process whose endpoint is a sample from the target distribution. The drift of the diffusion process is typically represented as a neural network. Stochastic localization is a successful technique to prove mixing of Markov Chains and other functional inequalities in high dimension. An algorithmic version of stochastic localization was recently proposed in order to sample from certain statistical mechanics models. This expository article has three objectives: $(i)$~Generalize the algorithmic construction to other stochastic localization processes. This construction is both simple and broadly applicable; $(ii)$~Clarify the connection between diffusions and stochastic localization. This allows to derive several known sampling schemes in a unified fashion; $(iii)$~Describe the insights that follow from this unified viewpoint.
