Fast Sampling via Discrete Non-Markov Diffusion Models with Predetermined Transition Time
Zixiang Chen, Huizhuo Yuan, Yongqian Li, Yiwen Kou, Junkai Zhang, Quanquan Gu
TL;DR
Problem: slow sampling in discrete diffusion models limits practical use for text generation and machine translation. Approach: discrete non-Markov diffusion model with a predetermined transition-time set that enables training-free, deterministic reverse sampling, plus a continuous-time infinite-step variant DNDM-C. Contributions: (i) a theoretically grounded DNDM that preserves q(x_t) and q(x0|x_t) and reduces neural evaluations, (ii) an accelerated reverse sampler with NFE ~ |T|, (iii) the infinite-step sampling insight bridging discrete and continuous-time diffusion, and (iv) extensive language generation experiments showing large speedups with competitive quality. Impact: enables fast discrete diffusion for NLP tasks and provides a unified framework for discrete-transition-time sampling; future work extends to audio/image.
Abstract
Discrete diffusion models have emerged as powerful tools for high-quality data generation. Despite their success in discrete spaces, such as text generation tasks, the acceleration of discrete diffusion models remains under-explored. In this paper, we propose discrete non-Markov diffusion models (DNDM), which naturally induce the predetermined transition time set. This enables a training-free sampling algorithm that significantly reduces the number of function evaluations (i.e., calls to the neural network), making the sampling process much faster. Furthermore, we study the transition from finite to infinite step sampling, offering new insights into bridging the gap between discrete and continuous-time processes for discrete diffusion models. Extensive experiments on natural language generation and machine translation tasks demonstrate the superior performance of our method in terms of both generation speed and sample quality compared to existing methods for discrete diffusion models.
