Feedback Guidance of Diffusion Models
Felix Koulischer, Florian Handke, Johannes Deleu, Thomas Demeester, Luca Ambrogioni
TL;DR
Classifier-Free Guidance (CFG) improves diffusion-based conditional generation but can hurt diversity when applied uniformly. This work introduces FeedBack Guidance (FBG), a state- and time-dependent guidance mechanism derived from an additive error model and posterior tracking, enabling dynamic adjustment of guidance during inference. Empirical results show FBG outperforms CFG on ImageNet-512×512 and remains competitive with Limited Interval Guidance, while automatically increasing guidance for complex Text-To-Image prompts and remaining compatible with CFG and LIG. The approach provides a solid theoretical framework for adaptive guidance and demonstrates practical benefits across image and text-to-image generation tasks.
Abstract
While Classifier-Free Guidance (CFG) has become standard for improving sample fidelity in conditional diffusion models, it can harm diversity and induce memorization by applying constant guidance regardless of whether a particular sample needs correction. We propose FeedBack Guidance (FBG), which uses a state-dependent coefficient to self-regulate guidance amounts based on need. Our approach is derived from first principles by assuming the learned conditional distribution is linearly corrupted by the unconditional distribution, contrasting with CFG's implicit multiplicative assumption. Our scheme relies on feedback of its own predictions about the conditional signal informativeness to adapt guidance dynamically during inference, challenging the view of guidance as a fixed hyperparameter. The approach is benchmarked on ImageNet512x512, where it significantly outperforms Classifier-Free Guidance and is competitive to Limited Interval Guidance (LIG) while benefitting from a strong mathematical framework. On Text-To-Image generation, we demonstrate that, as anticipated, our approach automatically applies higher guidance scales for complex prompts than for simpler ones and that it can be easily combined with existing guidance schemes such as CFG or LIG.
