$\textit{Jump Your Steps}$: Optimizing Sampling Schedule of Discrete Diffusion Models
Yong-Hyun Park, Chieh-Hsin Lai, Satoshi Hayakawa, Yuhta Takida, Yuki Mitsufuji
TL;DR
Jump Your Steps (JYS) addresses slow sampling in discrete diffusion models by optimizing the discrete sampling schedule to minimize Compounding Decoding Error (CDE) without extra computation. The authors derive a tractable KLUB bound via Girsanov’s theorem for CTMCs and propose a hierarchical, KLUB-guided procedure to search for optimal timesteps, applicable to τ-leaping and $k$-Gillespie samplers. Across synthetic data, CIFAR-10, music, and text, JYS consistently improves sample quality at various NFEs and noise schedules, illustrating its versatility as a general framework for fast, high-quality discrete diffusion sampling. The work also provides two practical techniques to estimate KLUB efficiently and analyzes how JYS adapts to data-dependent token dependencies. Overall, JYS offers a budget-friendly, architecture-agnostic approach to accelerate discrete diffusion generation with quantified error control.
Abstract
Diffusion models have seen notable success in continuous domains, leading to the development of discrete diffusion models (DDMs) for discrete variables. Despite recent advances, DDMs face the challenge of slow sampling speeds. While parallel sampling methods like $τ$-leaping accelerate this process, they introduce $\textit{Compounding Decoding Error}$ (CDE), where discrepancies arise between the true distribution and the approximation from parallel token generation, leading to degraded sample quality. In this work, we present $\textit{Jump Your Steps}$ (JYS), a novel approach that optimizes the allocation of discrete sampling timesteps by minimizing CDE without extra computational cost. More precisely, we derive a practical upper bound on CDE and propose an efficient algorithm for searching for the optimal sampling schedule. Extensive experiments across image, music, and text generation show that JYS significantly improves sampling quality, establishing it as a versatile framework for enhancing DDM performance for fast sampling.
