ReDi: Rectified Discrete Flow
Jaehoon Yoo, Wonjung Kim, Seunghoon Hong
TL;DR
ReDi targets slow sampling in Discrete Flow-based Models by addressing the factorization error quantified via the conditional total correlation $TC_ ext{π}(X_s|X_t)$. It iteratively rectifies the coupling using a learned conditional model $p_\theta(X_1|X_0)$ to define a new coupling $π_{k+1}$, with a guarantee that $TC_{π_{k+1}}(X_1|X_0) \le TC_{π_k}(X_1|X_0)$. Empirically, ReDi reduces $TC$ for image and text generation, matching or surpassing distillation baselines in few-step generation, while enabling strong one-step generation via rectified couplings; a perturbed-rectification variant improves robustness in high-dimensional data. The method is simple, memory-efficient, and broadly applicable to DFMs, offering a practical path to faster discrete data synthesis without intricate teacher-student training.
Abstract
Discrete Flow-based Models (DFMs) are powerful generative models for high-quality discrete data but typically suffer from slow sampling speeds due to their reliance on iterative decoding processes. This reliance on a multi-step process originates from the factorization approximation of DFMs, which is necessary for handling high-dimensional data. In this paper, we analyze the factorization approximation error using Conditional Total Correlation (TC), and reveal its dependence on the coupling. To address the challenge of efficient few-step generation, we propose Rectified Discrete Flow (ReDi), a novel iterative method that reduces the underlying factorization error (measured as Conditional TC) by rectifying the coupling between source and target distributions. We theoretically prove that each ReDi step guarantees a monotonic decreasing Conditional TC, ensuring its convergence. Empirically, ReDi significantly reduces Conditional TC and enables few-step generation. Moreover, we demonstrate that the rectified couplings are well-suited for training efficient one-step models on image generation. ReDi offers a simple and theoretically grounded approach for tackling the few-step challenge, providing a new perspective on efficient discrete data synthesis. Code is available at https://github.com/Ugness/ReDi_discrete.
