Implicit Dynamical Flow Fusion (IDFF) for Generative Modeling
Mohammad R. Rezaei, Milos R. Popovic, Milad Lankarany, Rahul G. Krishnan
TL;DR
IDFF presents a momentum-augmented, higher-order dynamical flow fusion framework that upgrades conditional flow matching by injecting a learnable momentum term directly into the sampling vector field and by learning in the data sample space. This design eliminates the need for computationally expensive OT steps, enables larger integration steps, and achieves substantial reductions in the number of function evaluations while preserving sample fidelity across both image and time-series domains. The approach extends to higher-order momentum and supports time-series as well as static data, with a training objective that jointly learns the denoised target and multiple orders of the log-density derivatives. Empirically, IDFF attains competitive or superior performance to CFMs and diffusion-based methods on CIFAR-10, CelebA, MD simulations, and SST forecasting, often with NFEs as low as 5 and significantly faster sampling, highlighting its practical impact for fast, flexible generative modeling. Limitations include increased computational cost for higher-order derivatives and reliance on backbone architectures; future work aims to optimize higher-order terms and explore broader applications such as audio and biological time-series modeling.
Abstract
Conditional Flow Matching (CFM) models can generate high-quality samples from a non-informative prior, but they can be slow, often needing hundreds of network evaluations (NFE). To address this, we propose Implicit Dynamical Flow Fusion (IDFF); IDFF learns a new vector field with an additional momentum term that enables taking longer steps during sample generation while maintaining the fidelity of the generated distribution. Consequently, IDFFs reduce the NFEs by a factor of ten (relative to CFMs) without sacrificing sample quality, enabling rapid sampling and efficient handling of image and time-series data generation tasks. We evaluate IDFF on standard benchmarks such as CIFAR-10 and CelebA for image generation, where we achieve likelihood and quality performance comparable to CFMs and diffusion-based models with fewer NFEs. IDFF also shows superior performance on time-series datasets modeling, including molecular simulation and sea surface temperature (SST) datasets, highlighting its versatility and effectiveness across different domains.\href{https://github.com/MrRezaeiUofT/IDFF}{Github Repository}
