Statistical Properties of Rectified Flow
Gonzalo Mena, Arun Kumar Kuchibhotla, Larry Wasserman
TL;DR
This work develops a rigorous statistical theory for rectified flow, a velocity-field–driven method to construct transport maps between distributions. It introduces multiple representations of the velocity under independence coupling and derives four estimation approaches (density-based, regression-based, substitutions, and semiparametric), along with smoothing variants, to estimate the velocity and the resulting rectified map. The authors establish existence, regularity, and convergence rates for the rectified flow in both unbounded and bounded domains, including central limit theorems and perturbation-based linearization results that quantify how estimation error propagates through the ODE. Through explicit examples, E2E analysis, and numerical experiments, the paper demonstrates that rectified flow can achieve faster rates than standard nonparametric regression or density estimation and provides practical, scalable tools for transport-map estimation without solving large variational problems. Overall, the work bridges statistical regression tools and dynamical transport, offering theoretically grounded, implementable methods for approximate transport maps with quantifiable uncertainty.
Abstract
Rectified flow (Liu et al., 2022; Liu, 2022; Wu et al., 2023) is a method for defining a transport map between two distributions, and enjoys popularity in machine learning, although theoretical results supporting the validity of these methods are scant. The rectified flow can be regarded as an approximation to optimal transport, but in contrast to other transport methods that require optimization over a function space, computing the rectified flow only requires standard statistical tools such as regression or density estimation, which we leverage to develop empirical versions of transport maps. We study some structural properties of the rectified flow, including existence, uniqueness, and regularity, as well as the related statistical properties, such as rates of convergence and central limit theorems, for some selected estimators. To do so, we analyze the bounded and unbounded cases separately as each presents unique challenges. In both cases, we are able to establish convergence at faster rates than those for the usual nonparametric regression and density estimation.
