An optimal control perspective on diffusion-based generative modeling
Julius Berner, Lorenz Richter, Karen Ullrich
TL;DR
The paper connects stochastic optimal control with diffusion-based generative models by deriving an HJB equation for the time evolution of log-densities and showing that the diffusion-model ELBO arises from a control-theoretic verification theorem. It reframes diffusion modeling as a path-space KL minimization and introduces time-reversed diffusion sampling (DIS) to sample from unnormalized densities, bridging diffusion models with sampling in statistics and physics. Through numerical experiments on Gaussian mixtures, Funnel, and double-well densities, DIS demonstrates competitive or superior performance to existing diffusion samplers, while analyses against Schrödinger bridges clarify the methodological distinctions. The work suggests promising future directions, including PDE-based solvers, alternative divergences on path space, and extensions to Schrödinger bridges for more flexible and robust sampling tasks.
Abstract
We establish a connection between stochastic optimal control and generative models based on stochastic differential equations (SDEs), such as recently developed diffusion probabilistic models. In particular, we derive a Hamilton-Jacobi-Bellman equation that governs the evolution of the log-densities of the underlying SDE marginals. This perspective allows to transfer methods from optimal control theory to generative modeling. First, we show that the evidence lower bound is a direct consequence of the well-known verification theorem from control theory. Further, we can formulate diffusion-based generative modeling as a minimization of the Kullback-Leibler divergence between suitable measures in path space. Finally, we develop a novel diffusion-based method for sampling from unnormalized densities -- a problem frequently occurring in statistics and computational sciences. We demonstrate that our time-reversed diffusion sampler (DIS) can outperform other diffusion-based sampling approaches on multiple numerical examples.
