Analyzing and Mitigating Model Collapse in Rectified Flow Models
Huminhao Zhu, Fangyikang Wang, Tianyu Ding, Qing Qu, Zhihui Zhu
TL;DR
The paper addresses MC in diffusion/flow models trained with self-generated data, focusing on Rectified Flow and its Reflow procedure. It provides a theoretical MC analysis by studying a linear Denoising Autoencoder subjected to recursive self-training, linking collapse mechanisms to diffusion/Rectified Flow, and proves that introducing real data can prevent collapse. The authors then propose Real-data Augmented Reflow (RA Reflow) and online variants (ORA Reflow, ORAS Reflow) that blend real reverse noise–image pairs with synthetic data to straighten flows while preserving stability. Empirical results on Gaussian tasks and standard image benchmarks (CIFAR-10, CelebA-HQ) show that RA/ORA/ORAS Reflow mitigates MC and achieves high-quality sampling with fewer function evaluations. The work advances understanding of MC in simulation-free generative models and offers practical, data-efficient strategies to maintain sampling efficiency in Rectified Flow.
Abstract
Training with synthetic data is becoming increasingly inevitable as synthetic content proliferates across the web, driven by the remarkable performance of recent deep generative models. This reliance on synthetic data can also be intentional, as seen in Rectified Flow models, whose Reflow method iteratively uses self-generated data to straighten the flow and improve sampling efficiency. However, recent studies have shown that repeatedly training on self-generated samples can lead to model collapse (MC), where performance degrades over time. Despite this, most recent work on MC either focuses on empirical observations or analyzes regression problems and maximum likelihood objectives, leaving a rigorous theoretical analysis of reflow methods unexplored. In this paper, we aim to fill this gap by providing both theoretical analysis and practical solutions for addressing MC in diffusion/flow models. We begin by studying Denoising Autoencoders and prove performance degradation when DAEs are iteratively trained on their own outputs. To the best of our knowledge, we are the first to rigorously analyze model collapse in DAEs and, by extension, in diffusion models and Rectified Flow. Our analysis and experiments demonstrate that rectified flow also suffers from MC, leading to potential performance degradation in each reflow step. Additionally, we prove that incorporating real data can prevent MC during recursive DAE training, supporting the recent trend of using real data as an effective approach for mitigating MC. Building on these insights, we propose a novel Real-data Augmented Reflow and a series of improved variants, which seamlessly integrate real data into Reflow training by leveraging reverse flow. Empirical evaluations on standard image benchmarks confirm that RA Reflow effectively mitigates model collapse, preserving high-quality sample generation even with fewer sampling steps.
