Improved sampling via learned diffusions
Lorenz Richter, Julius Berner
TL;DR
The paper addresses sampling from unnormalized densities using diffusion-based methods and presents a unifying path-space framework that treats the sampling task as time-reversal of controlled diffusions. By introducing divergences between path-space measures, notably the log-variance loss, it overcomes limitations of reverse KL objectives such as mode collapse and high variance. The approach connects Schrödinger bridges, diffusion-based generative modeling, and time-reversed reference-process methods into a single formalism, providing practical, differentiable losses that can be optimized with gradient-based methods. Empirical results on GMM, Funnel, and double-well benchmarks demonstrate that the log-variance loss yields improved stability, mode coverage, and sampling quality across DIS, PIS, and related methods, with strong performance in higher dimensions. Overall, the work offers a principled, scalable route to improved diffusion-based sampling for unnormalized targets and lays groundwork for problem-tailored divergences.
Abstract
Recently, a series of papers proposed deep learning-based approaches to sample from target distributions using controlled diffusion processes, being trained only on the unnormalized target densities without access to samples. Building on previous work, we identify these approaches as special cases of a generalized Schrödinger bridge problem, seeking a stochastic evolution between a given prior distribution and the specified target. We further generalize this framework by introducing a variational formulation based on divergences between path space measures of time-reversed diffusion processes. This abstract perspective leads to practical losses that can be optimized by gradient-based algorithms and includes previous objectives as special cases. At the same time, it allows us to consider divergences other than the reverse Kullback-Leibler divergence that is known to suffer from mode collapse. In particular, we propose the so-called log-variance loss, which exhibits favorable numerical properties and leads to significantly improved performance across all considered approaches.
