An Overview of Diffusion Models: Applications, Guided Generation, Statistical Rates and Optimization
Minshuo Chen, Song Mei, Jianqing Fan, Mengdi Wang
TL;DR
The paper surveys diffusion models from a continuous-time, score-based lens, connecting unconditional diffusion theory to conditional, guided generation and showcasing applications across vision, audio, RL, and life sciences. It consolidates learning and estimation of score functions, distribution learning, and sampling theory, and extends to conditional diffusion, guidance mechanisms, and diffusion-based black-box optimization. Key contributions include theoretical progress on score estimation under subspace and manifold structures, guidance-strength analyses, and a principled optimization framework that treats constrained rewards as conditional sampling problems. The synthesis clarifies when diffusion models can be trusted for high-dimensional, structured data and how guidance and optimization perspectives open new research directions. It also outlines future directions, including stochastic control links, robustness considerations, and discrete-diffusion variants for discrete data settings.
Abstract
Diffusion models, a powerful and universal generative AI technology, have achieved tremendous success in computer vision, audio, reinforcement learning, and computational biology. In these applications, diffusion models provide flexible high-dimensional data modeling, and act as a sampler for generating new samples under active guidance towards task-desired properties. Despite the significant empirical success, theory of diffusion models is very limited, potentially slowing down principled methodological innovations for further harnessing and improving diffusion models. In this paper, we review emerging applications of diffusion models, understanding their sample generation under various controls. Next, we overview the existing theories of diffusion models, covering their statistical properties and sampling capabilities. We adopt a progressive routine, beginning with unconditional diffusion models and connecting to conditional counterparts. Further, we review a new avenue in high-dimensional structured optimization through conditional diffusion models, where searching for solutions is reformulated as a conditional sampling problem and solved by diffusion models. Lastly, we discuss future directions about diffusion models. The purpose of this paper is to provide a well-rounded theoretical exposure for stimulating forward-looking theories and methods of diffusion models.
