Learning Straight Flows: Variational Flow Matching for Efficient Generation
Chenrui Ma, Xi Xiao, Tianyang Wang, Xiao Wang, Yanning Shen
TL;DR
This work tackles the curvature of flows learned by Flow Matching caused by independent sampling couplings. It introduces Straight Variational Flow Matching (S-VFM), which conditions a velocity field on a global generation overview via a variational latent code and enforces straight trajectories by minimizing the time derivative $D_t v$ along generation paths. The authors provide a theoretical foundation for straight interpolants, proving marginal preservation and equivalence to vanishing $D_t v$, and demonstrate strong empirical gains in one-step and few-step generation on CIFAR-10 and ImageNet, along with synthetic visualizations. Collectively, S-VFM improves generation fidelity, sampling efficiency, and training stability by resolving the intrinsic conflict between straight trajectory learning and independent couplings, with practical impact on fast, high-quality generative modeling.
Abstract
Flow Matching has limited ability in achieving one-step generation due to its reliance on learned curved trajectories. Previous studies have attempted to address this limitation by either modifying the coupling distribution to prevent interpolant intersections or introducing consistency and mean-velocity modeling to promote straight trajectory learning. However, these approaches often suffer from discrete approximation errors, training instability, and convergence difficulties. To tackle these issues, in the present work, we propose \textbf{S}traight \textbf{V}ariational \textbf{F}low \textbf{M}atching (\textbf{S-VFM}), which integrates a variational latent code representing the ``generation overview'' into the Flow Matching framework. \textbf{S-VFM} explicitly enforces trajectory straightness, ideally producing linear generation paths. The proposed method achieves competitive performance across three challenge benchmarks and demonstrates advantages in both training and inference efficiency compared with existing methods.
