Variational Control for Guidance in Diffusion Models
Kushagra Pandey, Farrin Marouf Sofian, Felix Draxler, Theofanis Karaletsos, Stephan Mandt
TL;DR
This work reframes guidance in pretrained diffusion models as a variational control problem, introducing Diffusion Trajectory Matching (DTM) to steer diffusion trajectories toward a terminal constraint without retraining. By formulating a terminal cost and a transient KL divergence, and implementing greedy, stepwise optimization, the framework unifies existing guidance methods and enables non-linear control via Non-linear Diffusion Trajectory Matching (NDTM). For DDIM samplers, NDTM yields tractable bounds that balance deviation from unguided dynamics with task-specific terminal objectives, achieving state-of-the-art results on challenging inverse problems and effective style guidance with latent-space diffusion models. The approach offers a flexible, training-free mechanism to adapt pretrained diffusion priors to diverse tasks, with potential for deeper theoretical analysis and faster sampling strategies in future work.
Abstract
Diffusion models exhibit excellent sample quality, but existing guidance methods often require additional model training or are limited to specific tasks. We revisit guidance in diffusion models from the perspective of variational inference and control, introducing Diffusion Trajectory Matching (DTM) that enables guiding pretrained diffusion trajectories to satisfy a terminal cost. DTM unifies a broad class of guidance methods and enables novel instantiations. We introduce a new method within this framework that achieves state-of-the-art results on several linear, non-linear, and blind inverse problems without requiring additional model training or specificity to pixel or latent space diffusion models. Our code will be available at https://github.com/czi-ai/oc-guidance
