Mitigating Diffusion Model Hallucinations with Dynamic Guidance
Kostas Triaridis, Alexandros Graikos, Aggelina Chatziagapi, Grigorios G. Chrysos, Dimitris Samaras
TL;DR
Dynamic Guidance tackles diffusion-model hallucinations by adaptively sharpening the score function $s_ heta(x_t,t)$ along directions that induce artifacts during sampling, while preserving benign semantic interpolations. At each timestep, it identifies the most likely class $y^*= ext{argmax}_y\log p(y|x_t)$ and applies guided denoising with $\\hat{\epsilon}=\epsilon_\theta(x_t,t) - \lambda\sqrt{1-\bar{\alpha}_t}\nabla_{x_t}\log p(y^*|x_t)$, enabling generation-time control without fixed early-conditioning. The method is evaluated from toy 2D Gaussians to real-world ImageNet-scale generation, showing substantial hallucination reductions (often >50%) across settings and improved proxy metrics (precision, Inception Score) over traditional guidance and post-hoc filtering. This work provides a principled, efficient approach to reduce hallucinations in diffusion sampling, with practical impact for more reliable and diverse image generation.
Abstract
Diffusion models, despite their impressive demos, often produce hallucinatory samples with structural inconsistencies that lie outside of the support of the true data distribution. Such hallucinations can be attributed to excessive smoothing between modes of the data distribution. However, semantic interpolations are often desirable and can lead to generation diversity, thus we believe a more nuanced solution is required. In this work, we introduce Dynamic Guidance, which tackles this issue. Dynamic Guidance mitigates hallucinations by selectively sharpening the score function only along the pre-determined directions known to cause artifacts, while preserving valid semantic variations. To our knowledge, this is the first approach that addresses hallucinations at generation time rather than through post-hoc filtering. Dynamic Guidance substantially reduces hallucinations on both controlled and natural image datasets, significantly outperforming baselines.
