Generative Modeling with Continuous Flows: Sample Complexity of Flow Matching

Mudit Gaur; Prashant Trivedi; Shuchin Aeron; Amrit Singh Bedi; George K. Atia; Vaneet Aggarwal

Generative Modeling with Continuous Flows: Sample Complexity of Flow Matching

Mudit Gaur, Prashant Trivedi, Shuchin Aeron, Amrit Singh Bedi, George K. Atia, Vaneet Aggarwal

TL;DR

This work analyzes the sample efficiency of flow-matching generative models by recasting them as deterministic ODE flows driven by a velocity field learned via neural networks. It introduces a Gaussian probability path to achieve a tractable, bounded-analysis setting and decomposes the velocity-estimation error into approximation, statistical, and optimization components. The authors prove a first formal sample complexity bound for flow matching without requiring ERM, showing that a fully expressive neural network can achieve $W_2$ distance $\mathcal{O}(\varepsilon)$ with $n = \Omega((W)^{2D-2} d^2 / \varepsilon^4 \log(2/\delta))$ samples, up to a model-approximation error. This work bridges theory and practice for flow-based generative modeling, providing foundations for data-efficient training and fast sampling in continuous-flow frameworks.

Abstract

Flow matching has recently emerged as a promising alternative to diffusion-based generative models, offering faster sampling and simpler training by learning continuous flows governed by ordinary differential equations. Despite growing empirical success, the theoretical understanding of flow matching remains limited, particularly in terms of sample complexity results. In this work, we provide the first analysis of the sample complexity for flow-matching based generative models without assuming access to the empirical risk minimizer (ERM) of the loss function for estimating the velocity field. Under standard assumptions on the loss function for velocity field estimation and boundedness of the data distribution, we show that a sufficiently expressive neural network can learn a velocity field such that with $\mathcal{O}(ε^{-4})$ samples, such that the Wasserstein-2 distance between the learned and the true distribution is less than $\mathcal{O}(ε)$. The key technical idea is to decompose the velocity field estimation error into neural-network approximation error, statistical error due to the finite sample size, and optimization error due to the finite number of optimization steps for estimating the velocity field. Each of these terms are then handled via techniques that may be of independent interest.

Generative Modeling with Continuous Flows: Sample Complexity of Flow Matching

TL;DR

Abstract

Generative Modeling with Continuous Flows: Sample Complexity of Flow Matching

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Theorems & Definitions (17)