A Kinetic-Energy Perspective of Flow Matching
Ziyun Li, Huancheng Hu, Soon Hoe Lim, Xuyu Li, Fei Gao, Enmao Diao, Zezhen Ding, Michalis Vazirgiannis, Henrik Bostrom
TL;DR
This paper introduces Kinetic Path Energy (KPE), a per-sample diagnostic that integrates the squared velocity along a flow-based generation trajectory to quantify kinetic effort. It shows two robust correspondences: higher KPE aligns with stronger semantic fidelity and with trajectories ending in low-density regions of the data manifold, and it derives a theoretical energy–density relation under posterior dominance. A notable paradox is revealed: the closed-form empirical flow matching (EFM) solution can achieve substantially higher peak energy yet memorize training data, due to a terminal-energy blow-up governed by a 1/(1−t) factor. To address this, the authors propose Kinetic Trajectory Shaping (KTS), a training-free two-phase inference method that boosts early motion and soft-landings late to reduce memorization while improving generation quality, demonstrated on CelebA and ImageNet-256. The work uncovers a Goldilocks principle for kinetic energy in flow-based generation and highlights trajectory-level diagnostics as a powerful lens for understanding and controlling generative dynamics.
Abstract
Flow-based generative models can be viewed through a physics lens: sampling transports a particle from noise to data by integrating a time-varying velocity field, and each sample corresponds to a trajectory with its own dynamical effort. Motivated by classical mechanics, we introduce Kinetic Path Energy (KPE), an action-like, per-sample diagnostic that measures the accumulated kinetic effort along an Ordinary Differential Equation (ODE) trajectory. Empirically, KPE exhibits two robust correspondences: (i) higher KPE predicts stronger semantic fidelity; (ii) high-KPE trajectories terminate on low-density manifold frontiers. We further provide theoretical guarantees linking trajectory energy to data density. Paradoxically, this correlation is non-monotonic. At sufficiently high energy, generation can degenerate into memorization. Leveraging the closed-form of empirical flow matching, we show that extreme energies drive trajectories toward near-copies of training examples. This yields a Goldilocks principle and motivates Kinetic Trajectory Shaping (KTS), a training-free two-phase inference strategy that boosts early motion and enforces a late-time soft landing, reducing memorization and improving generation quality across benchmark tasks.
