The Generation Phases of Flow Matching: a Denoising Perspective
Anne Gagneux, Ségolène Martin, Rémi Gribonval, Mathurin Massias
TL;DR
Flow matching can be understood through a denoising lens, enabling principled comparisons with denoisers at every noise level. The authors construct a denoising toolkit that maps between denoisers and velocities, and they design drift and noise perturbations to probe generation dynamics. They show that different denoising losses and parametrizations, although theoretically equivalent under perfect optimization, produce distinct generation and denoising behaviors and reveal temporally distinct phases: early drift-sensitive and late noise-sensitive. The findings highlight intermediate-time importance for generation quality and offer a principled path to customize generation dynamics via controlled training and perturbations.
Abstract
Flow matching has achieved remarkable success, yet the factors influencing the quality of its generation process remain poorly understood. In this work, we adopt a denoising perspective and design a framework to empirically probe the generation process. Laying down the formal connections between flow matching models and denoisers, we provide a common ground to compare their performances on generation and denoising. This enables the design of principled and controlled perturbations to influence sample generation: noise and drift. This leads to new insights on the distinct dynamical phases of the generative process, enabling us to precisely characterize at which stage of the generative process denoisers succeed or fail and why this matters.
