Transport Based Mean Flows for Generative Modeling
Elaheh Akbari, Ping He, Ahmadreza Moradipari, Yikun Bai, Soheil Kolouri
TL;DR
OT-Mean Flow addresses slow inference in flow-based generative modeling by learning a mean velocity aligned with an optimal transport coupling between the source and target distributions, i.e., the OT objective $OT(\mathbf{p},\mathbf{q})$. The method unifies conditional flow matching, minibatch OT flow matching, and mean-flow training under a single objective, enabling one-step generation. It introduces OT acceleration strategies such as Sinkhorn OT and Linear OT to scale training, and demonstrates improved inference accuracy and speed on low-dimensional controls, image generation (MNIST), and 3D point clouds and unpaired image-to-image translation. The results indicate that geometry-aware transport couplings yield straighter, more faithful trajectories than vanilla mean-flow.
Abstract
Flow-matching generative models have emerged as a powerful paradigm for continuous data generation, achieving state-of-the-art results across domains such as images, 3D shapes, and point clouds. Despite their success, these models suffer from slow inference due to the requirement of numerous sequential sampling steps. Recent work has sought to accelerate inference by reducing the number of sampling steps. In particular, Mean Flows offer a one-step generation approach that delivers substantial speedups while retaining strong generative performance. Yet, in many continuous domains, Mean Flows fail to faithfully approximate the behavior of the original multi-step flow-matching process. In this work, we address this limitation by incorporating optimal transport-based sampling strategies into the Mean Flow framework, enabling one-step generators that better preserve the fidelity and diversity of the original multi-step flow process. Experiments on controlled low-dimensional settings and on high-dimensional tasks such as image generation, image-to-image translation, and point cloud generation demonstrate that our approach achieves superior inference accuracy in one-step generative modeling.
