Accelerated Sequential Flow Matching: A Bayesian Filtering Perspective
Yinan Huang, Hans Hao-Hsun Hsu, Junran Wang, Bo Dai, Pan Li
TL;DR
This work reframes sequential inference in streaming stochastic systems as a Bayesian filtering problem and introduces Sequential Flow Matching, which learns a probability flow transporting the predictive distribution from $p(x_{t-1}|z_{\le t-1})$ to $p(x_t|z_{\le t})$. By leveraging the previous posterior as a principled warm start, the method reduces sampling error compared to restarting from a non-informative base and achieves performance competitive with full-step diffusion using only one or a few sampling steps. Across forecasting, planning/control, and state estimation tasks, it demonstrates favorable fidelity-latency tradeoffs, enabling real-time deployment of flow-based models. The approach provides a principled perspective on efficient sequential inference, with practical gains in latency and robustness, though it relies on a pretraining-generation-finetuning pipeline and could benefit from simulation-free training in future work.
Abstract
Sequential prediction from streaming observations is a fundamental problem in stochastic dynamical systems, where inherent uncertainty often leads to multiple plausible futures. While diffusion and flow-matching models are capable of modeling complex, multi-modal trajectories, their deployment in real-time streaming environments typically relies on repeated sampling from a non-informative initial distribution, incurring substantial inference latency and potential system backlogs. In this work, we introduce Sequential Flow Matching, a principled framework grounded in Bayesian filtering. By treating streaming inference as learning a probability flow that transports the predictive distribution from one time step to the next, our approach naturally aligns with the recursive structure of Bayesian belief updates. We provide theoretical justification that initializing generation from the previous posterior offers a principled warm start that can accelerate sampling compared to naïve re-sampling. Across a wide range of forecasting, decision-making and state estimation tasks, our method achieves performance competitive with full-step diffusion while requiring only one or very few sampling steps, therefore with faster sampling. It suggests that framing sequential inference via Bayesian filtering provides a new and principled perspective towards efficient real-time deployment of flow-based models.
