Devil is in the Details: Density Guidance for Detail-Aware Generation with Flow Models
Rafał Karczewski, Markus Heinonen, Vikas Garg
TL;DR
This work addresses the disconnect between image likelihood and perceptual detail in diffusion/flow models by showing that higher density samples tend to be smoother and less detailed. It introduces Score Alignment to explain why latent-code scaling (Prior Guidance) affects detail and proves tractable checks for CNFs; it then derives Density Guidance, a principled ODE modification that enforces explicit log-density trajectories during sampling. Extending Density Guidance to stochastic sampling yields Stochastic Density Guidance, which preserves exact log-density control while enabling controlled variation in high-level structure or fine details. Empirically, Density Guidance achieves fine-grained control of image detail with sample quality on par with prior methods, and the approach generalizes to conditional generation and stochastic settings, offering practical tools for density-aware generation.
Abstract
Diffusion models have emerged as a powerful class of generative models, capable of producing high-quality images by mapping noise to a data distribution. However, recent findings suggest that image likelihood does not align with perceptual quality: high-likelihood samples tend to be smooth, while lower-likelihood ones are more detailed. Controlling sample density is thus crucial for balancing realism and detail. In this paper, we analyze an existing technique, Prior Guidance, which scales the latent code to influence image detail. We introduce score alignment, a condition that explains why this method works and show that it can be tractably checked for any continuous normalizing flow model. We then propose Density Guidance, a principled modification of the generative ODE that enables exact log-density control during sampling. Finally, we extend Density Guidance to stochastic sampling, ensuring precise log-density control while allowing controlled variation in structure or fine details. Our experiments demonstrate that these techniques provide fine-grained control over image detail without compromising sample quality. Code is available at https://github.com/Aalto-QuML/density-guidance.
