Flow Matching: Markov Kernels, Stochastic Processes and Transport Plans
Christian Wald, Gabriele Steidl
TL;DR
This work surveys flow matching from a rigorous mathematical standpoint, unifying three perspectives—couplings, Markov kernels, and stochastic processes—for constructing velocity fields that transport from a latent to a data distribution within Wasserstein geometry. It shows how to learn these velocity fields through flow-matching losses and demonstrates applications to Bayesian inverse problems via conditional Wasserstein distances and partial constraints on couplings. The paper also situates flow matching relative to continuous normalizing flows and score-based diffusion, highlighting practical algorithmic schemes and numerical demonstrations. The resulting framework offers a principled route to sampling and posterior inference by evolving probability measures along AC curves rather than learning a single transport map. Overall, it provides both theoretical foundations and practical tools for scalable, geometry-aware generative modeling and inverse problems.
Abstract
Among generative neural models, flow matching techniques stand out for their simple applicability and good scaling properties. Here, velocity fields of curves connecting a simple latent and a target distribution are learned. Then the corresponding ordinary differential equation can be used to sample from a target distribution, starting in samples from the latent one. This paper reviews from a mathematical point of view different techniques to learn the velocity fields of absolutely continuous curves in the Wasserstein geometry. We show how the velocity fields can be characterized and learned via i) transport plans (couplings) between latent and target distributions, ii) Markov kernels and iii) stochastic processes, where the latter two include the coupling approach, but are in general broader. Besides this main goal, we show how flow matching can be used for solving Bayesian inverse problems, where the definition of conditional Wasserstein distances plays a central role. Finally, we briefly address continuous normalizing flows and score matching techniques, which approach the learning of velocity fields of curves from other directions.
