How Discrete and Continuous Diffusion Meet: Comprehensive Analysis of Discrete Diffusion Models via a Stochastic Integral Framework
Yinuo Ren, Haoxuan Chen, Grant M. Rotskoff, Lexing Ying
TL;DR
The paper addresses the limited theoretical analysis of discrete diffusion models by introducing a Lévy-type stochastic-integral framework built on Poisson random measures with evolving intensity. This approach yields a unified path-wise formulation for both forward and backward discrete diffusions and enables a change-of-measure analysis that links inference losses to KL divergence. It derives the first KL-based error bound for the τ-leaping scheme and provides a comprehensive comparison with the uniformization method, including explicit step bounds and complexity, under a set of practical assumptions. The framework relaxes several prior stringent conditions and establishes a principled basis for designing efficient, accurate discrete diffusion algorithms with rigorous error control across real-world applications. Overall, the work bridges discrete and continuous diffusion theories, offering tools to analyze and improve discrete diffusion models in diverse domains.
Abstract
Discrete diffusion models have gained increasing attention for their ability to model complex distributions with tractable sampling and inference. However, the error analysis for discrete diffusion models remains less well-understood. In this work, we propose a comprehensive framework for the error analysis of discrete diffusion models based on Lévy-type stochastic integrals. By generalizing the Poisson random measure to that with a time-independent and state-dependent intensity, we rigorously establish a stochastic integral formulation of discrete diffusion models and provide the corresponding change of measure theorems that are intriguingly analogous to Itô integrals and Girsanov's theorem for their continuous counterparts. Our framework unifies and strengthens the current theoretical results on discrete diffusion models and obtains the first error bound for the $τ$-leaping scheme in KL divergence. With error sources clearly identified, our analysis gives new insight into the mathematical properties of discrete diffusion models and offers guidance for the design of efficient and accurate algorithms for real-world discrete diffusion model applications.
