Generalization and Memorization in Rectified Flow

Mingxing Rao; Daniel Moyer

Generalization and Memorization in Rectified Flow

Mingxing Rao, Daniel Moyer

Abstract

Generative models based on the Flow Matching objective, particularly Rectified Flow, have emerged as a dominant paradigm for efficient, high-fidelity image synthesis. However, while existing research heavily prioritizes generation quality and architectural scaling, the underlying dynamics of how RF models memorize training data remain largely underexplored. In this paper, we systematically investigate the memorization behaviors of RF through the test statistics of Membership Inference Attacks (MIA). We progressively formulate three test statistics, culminating in a complexity-calibrated metric ($T_\text{mc\_cal}$) that successfully decouples intrinsic image spatial complexity from genuine memorization signals. This calibration yields a significant performance surge -- boosting attack AUC by up to 15\% and the privacy-critical TPR@1\%FPR metric by up to 45\% -- establishing the first non-trivial MIA specifically tailored for RF. Leveraging these refined metrics, we uncover a distinct temporal pattern: under standard uniform temporal training, a model's susceptibility to MIA strictly peaks at the integration midpoint, a phenomenon we justify via the network's forced deviation from linear approximations. Finally, we demonstrate that substituting uniform timestep sampling with a Symmetric Exponential (U-shaped) distribution effectively minimizes exposure to vulnerable intermediate timesteps. Extensive evaluations across three datasets confirm that this temporal regularization suppresses memorization while preserving generative fidelity.

Generalization and Memorization in Rectified Flow

Abstract

) that successfully decouples intrinsic image spatial complexity from genuine memorization signals. This calibration yields a significant performance surge -- boosting attack AUC by up to 15\% and the privacy-critical TPR@1\%FPR metric by up to 45\% -- establishing the first non-trivial MIA specifically tailored for RF. Leveraging these refined metrics, we uncover a distinct temporal pattern: under standard uniform temporal training, a model's susceptibility to MIA strictly peaks at the integration midpoint, a phenomenon we justify via the network's forced deviation from linear approximations. Finally, we demonstrate that substituting uniform timestep sampling with a Symmetric Exponential (U-shaped) distribution effectively minimizes exposure to vulnerable intermediate timesteps. Extensive evaluations across three datasets confirm that this temporal regularization suppresses memorization while preserving generative fidelity.

Paper Structure (29 sections, 40 equations, 8 figures, 5 tables)

This paper contains 29 sections, 40 equations, 8 figures, 5 tables.

Introduction
Background and Motivation
Membership Inference Attack
Generalization and Memorization
Differential Privacy
Input Complexity Bias in Generative Models
MIA Test Statistics for Rectified Flow
Deriving MIA Test Criteria
Complexity-Calibrated Test Criteria
Memorization Dynamics across Different Timesteps $t$
The Linearity Gap between $v_\theta$ and $v_\text{linear}$ across $t$
The Orthogonality between $x_t$ and $v$ at $t=0.5$
U-shape Sampling Distribution for Timesteps $t$
Experiments
Setup
...and 14 more sections

Figures (8)

Figure 1: MIA performance (AUC and TPR@1%FPR, at left and right respectively) across integration timesteps $t$ on CIFAR-10 and SVHN (top row and bottom row). Our complexity-calibrated statistic ($T_\text{mc\_cal}$) demonstrates a substantial performance surge over the baselines ($T_\text{naive}$ and $T_\text{mc}$).
Figure 2: Training dynamics (FID and peak AUC) when employing a Symmetric Exponential distribution for timestep sampling. Increasing the concentration parameter $\alpha$ (shown in order of Blue [none], Green, Orange [most]) effectively decelerates and suppresses model memorization while fully preserving generative fidelity.
Figure 3: Top row: The test statistics $T_\text{mc}$ without calibration. Bottom row: The test statistics $T_\text{mc}$ with calibration. The calibrated version provides more separable points between $\mathcal{D}_\text{train}$ and $\mathcal{D}_\text{val}$
Figure 4: The gap between $v_\theta$ and $v_\text{linear}$ for $\mathcal{D}_\text{train}$ and $\mathcal{D}_\text{val}$. Solid lines denote the mean, while the shaded areas indicate $\pm$ standard deviation.
Figure 5: The dynamic with increasing number of Monte Carlo samples on $T_\text{mc}$ and $T_\text{mc\_cal}$
...and 3 more figures

Theorems & Definitions (1)

proof

Generalization and Memorization in Rectified Flow

Abstract

Generalization and Memorization in Rectified Flow

Authors

Abstract

Table of Contents

Figures (8)

Theorems & Definitions (1)