Language Rectified Flow: Advancing Diffusion Language Generation with Probabilistic Flows

Shujian Zhang; Lemeng Wu; Chengyue Gong; Xingchao Liu

Language Rectified Flow: Advancing Diffusion Language Generation with Probabilistic Flows

Shujian Zhang, Lemeng Wu, Chengyue Gong, Xingchao Liu

TL;DR

This paper introduces Language Rectified Flow (LF), an ordinary differential equation-based transport framework that moves text representations from a source to a target distribution to enable fast, controllable language generation and domain transfer. LF uses a VAE to embed text in a continuous latent space and learns a neural velocity field guiding straight-line transport, trained with a lexicographic constrained optimization that balances reconstruction and transport quality. Empirically, LF achieves substantial speedups (up to tens of times faster than diffusion-based methods) and improves performance on fine-grained control tasks (POS, length, infill) and text editing on Yelp and Amazon datasets, with extensive ablations validating design choices. The approach generalizes across NLP tasks, providing a unified, efficient alternative to diffusion models for controllable text generation.

Abstract

Recent works have demonstrated success in controlling sentence attributes ($e.g.$, sentiment) and structure ($e.g.$, syntactic structure) based on the diffusion language model. A key component that drives theimpressive performance for generating high-quality samples from noise is iteratively denoise for thousands of steps. While beneficial, the complexity of starting from the noise and the learning steps has limited its implementation to many NLP real-world applications. This paper proposes Language Rectified Flow ({\ours}). Our method is based on the reformulation of the standard probabilistic flow models. Language rectified flow learns (neural) ordinary differential equation models to transport between the source distribution and the target distribution, hence providing a unified and effective solution to generative modeling and domain transfer. From the source distribution, our language rectified flow yields fast simulation and effectively decreases the inference time. Experiments on three challenging fine-grained control tasks and multiple high-quality text editing show that our method consistently outperforms its baselines. Extensive experiments and ablation studies demonstrate that our method can be general, effective, and beneficial for many NLP tasks.

Language Rectified Flow: Advancing Diffusion Language Generation with Probabilistic Flows

TL;DR

Abstract

Recent works have demonstrated success in controlling sentence attributes (

, sentiment) and structure (

, syntactic structure) based on the diffusion language model. A key component that drives theimpressive performance for generating high-quality samples from noise is iteratively denoise for thousands of steps. While beneficial, the complexity of starting from the noise and the learning steps has limited its implementation to many NLP real-world applications. This paper proposes Language Rectified Flow ({\ours}). Our method is based on the reformulation of the standard probabilistic flow models. Language rectified flow learns (neural) ordinary differential equation models to transport between the source distribution and the target distribution, hence providing a unified and effective solution to generative modeling and domain transfer. From the source distribution, our language rectified flow yields fast simulation and effectively decreases the inference time. Experiments on three challenging fine-grained control tasks and multiple high-quality text editing show that our method consistently outperforms its baselines. Extensive experiments and ablation studies demonstrate that our method can be general, effective, and beneficial for many NLP tasks.

Paper Structure (39 sections, 7 equations, 2 figures, 12 tables)

This paper contains 39 sections, 7 equations, 2 figures, 12 tables.

Introduction
Method
Encoding and Latent Space
Probability Flows
Efficiency with Language Rectified Flows
Constrained Optimization
Trade-off is a Problem.
Our Equation.
The Proposed Algorithm.
Experimental Settings
Control Tasks and Evaluation Metrics
Dataset.
Setting and Metrics.
Baselines.
Text Editing Task and Evaluation Metrics
...and 24 more sections

Figures (2)

Figure 1: Overview of language rectified flow. Some notations are labeled along with corresponding components.
Figure 2: Results of the study on different generation steps for ours on the infill text generation.

Language Rectified Flow: Advancing Diffusion Language Generation with Probabilistic Flows

TL;DR

Abstract

Language Rectified Flow: Advancing Diffusion Language Generation with Probabilistic Flows

Authors

TL;DR

Abstract

Table of Contents

Figures (2)