Efficient Approximate Posterior Sampling with Annealed Langevin Monte Carlo
Advait Parulekar, Litu Rout, Karthikeyan Shanmugam, Sanjay Shakkottai
TL;DR
This paper tackles the challenge of posterior sampling for score-based priors under a measurement model by introducing Annealed Langevin Monte Carlo (ALMC). Instead of chasing exact KL-posterior sampling, it constructs a time-varying path of posteriors μ_t for a noised prior and uses warm-started Langevin dynamics followed by annealing to track this path. The authors prove polynomial-time guarantees: early stopping yields simultaneous KL proximity to the annealed posterior and FI proximity to the true posterior, under minimal assumptions (sub-Gaussian priors, Lipschitz scores, convex smooth likelihood). They further show that combining KL and FI guarantees avoids mode-collapse in multimodal settings, providing both global and local correctness for approximate posterior sampling. The results offer a principled, tractable framework for posterior inference with score-based models and suggest avenues for extensions to other posterior-sampling paradigms.
Abstract
We study the problem of posterior sampling in the context of score based generative models. We have a trained score network for a prior $p(x)$, a measurement model $p(y|x)$, and are tasked with sampling from the posterior $p(x|y)$. Prior work has shown this to be intractable in KL (in the worst case) under well-accepted computational hardness assumptions. Despite this, popular algorithms for tasks such as image super-resolution, stylization, and reconstruction enjoy empirical success. Rather than establishing distributional assumptions or restricted settings under which exact posterior sampling is tractable, we view this as a more general "tilting" problem of biasing a distribution towards a measurement. Under minimal assumptions, we show that one can tractably sample from a distribution that is simultaneously close to the posterior of a noised prior in KL divergence and the true posterior in Fisher divergence. Intuitively, this combination ensures that the resulting sample is consistent with both the measurement and the prior. To the best of our knowledge these are the first formal results for (approximate) posterior sampling in polynomial time.
