CurveFlow: Curvature-Guided Flow Matching for Image Generation
Yan Luo, Drake Du, Hao Huang, Yi Fang, Mengyu Wang
TL;DR
Rectified Flow's zero-curvature linear trajectories can misalign image generation with complex textual prompts. CurveFlow introduces curvature-guided, non-linear trajectories parameterized by $z_t = a_\phi(t) x_0 + b_\psi(t) \epsilon$ and a robust curvature regularization to stabilize learning. Empirical results on MS COCO 2014/2017 show state-of-the-art text-to-image performance with improved semantic metrics (BLEU, METEOR, ROUGE, CLAIR) and strong image quality (FID), confirming that curvature-aware flow enhances instruction compliance. The approach offers an efficient, geometry-aware alternative to diffusion, enabling more faithful and detailed image synthesis guided by complex text.
Abstract
Existing rectified flow models are based on linear trajectories between data and noise distributions. This linearity enforces zero curvature, which can inadvertently force the image generation process through low-probability regions of the data manifold. A key question remains underexplored: how does the curvature of these trajectories correlate with the semantic alignment between generated images and their corresponding captions, i.e., instructional compliance? To address this, we introduce CurveFlow, a novel flow matching framework designed to learn smooth, non-linear trajectories by directly incorporating curvature guidance into the flow path. Our method features a robust curvature regularization technique that penalizes abrupt changes in the trajectory's intrinsic dynamics.Extensive experiments on MS COCO 2014 and 2017 demonstrate that CurveFlow achieves state-of-the-art performance in text-to-image generation, significantly outperforming both standard rectified flow variants and other non-linear baselines like Rectified Diffusion. The improvements are especially evident in semantic consistency metrics such as BLEU, METEOR, ROUGE, and CLAIR. This confirms that our curvature-aware modeling substantially enhances the model's ability to faithfully follow complex instructions while simultaneously maintaining high image quality. The code is made publicly available at https://github.com/Harvard-AI-and-Robotics-Lab/CurveFlow.
