Adaptive Diffusion Guidance via Stochastic Optimal Control

Iskander Azangulov; Peter Potaptchik; Qinyu Li; Eddie Aamari; George Deligiannidis; Judith Rousseau

Adaptive Diffusion Guidance via Stochastic Optimal Control

Iskander Azangulov, Peter Potaptchik, Qinyu Li, Eddie Aamari, George Deligiannidis, Judith Rousseau

TL;DR

This work addresses the theoretical gaps in diffusion-model guidance by formalizing how guidance strength $w$ affects classifier confidence and by proving that positive guidance keeps samples within the conditional data manifold. Building on these insights, it introduces a stochastic optimal control (SOC) framework that dynamically selects $w_t(x,c)$ during sampling, supported by a gradient-based learning algorithm that uses Girsanov reweighting to handle trajectory-distribution changes. The SOC objective balances improving class alignment with staying close to unguided trajectories, yielding a learnable policy $w_\theta$ that adapts across time, state, and conditioning class. Empirical results on toy settings demonstrate improved alignment and sample quality for learned schedules, while the authors acknowledge gaps in applying the method to real-world images and outline directions for robustification and scalable implementation.

Abstract

Guidance is a cornerstone of modern diffusion models, playing a pivotal role in conditional generation and enhancing the quality of unconditional samples. However, current approaches to guidance scheduling--determining the appropriate guidance weight--are largely heuristic and lack a solid theoretical foundation. This work addresses these limitations on two fronts. First, we provide a theoretical formalization that precisely characterizes the relationship between guidance strength and classifier confidence. Second, building on this insight, we introduce a stochastic optimal control framework that casts guidance scheduling as an adaptive optimization problem. In this formulation, guidance strength is not fixed but dynamically selected based on time, the current sample, and the conditioning class, either independently or in combination. By solving the resulting control problem, we establish a principled foundation for more effective guidance in diffusion models.

Adaptive Diffusion Guidance via Stochastic Optimal Control

TL;DR

This work addresses the theoretical gaps in diffusion-model guidance by formalizing how guidance strength

affects classifier confidence and by proving that positive guidance keeps samples within the conditional data manifold. Building on these insights, it introduces a stochastic optimal control (SOC) framework that dynamically selects

during sampling, supported by a gradient-based learning algorithm that uses Girsanov reweighting to handle trajectory-distribution changes. The SOC objective balances improving class alignment with staying close to unguided trajectories, yielding a learnable policy

that adapts across time, state, and conditioning class. Empirical results on toy settings demonstrate improved alignment and sample quality for learned schedules, while the authors acknowledge gaps in applying the method to real-world images and outline directions for robustification and scalable implementation.

Adaptive Diffusion Guidance via Stochastic Optimal Control

TL;DR

Abstract

Adaptive Diffusion Guidance via Stochastic Optimal Control

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (2)

Theorems & Definitions (12)