Probabilistic Forecasting for Dynamical Systems with Missing or Imperfect Data
Siddharth Rout, Eldad Haber, Stéphane Gaudreault
TL;DR
This work tackles forecasting of dynamical systems under missing or noisy data by adopting probabilistic forecasts through stochastic interpolation (SI) and flow matching. It learns a velocity field $v_ heta(q_t,t)$ to transform samples from the initial distribution ${ m pi}_0$ to the future distribution ${ m pi}_T$ via the ODE $dq/dt = v_ heta(q,t)$, enabling generation of diverse future states. To ensure realistic perturbations of the initial state, the method employs a flow-based variational autoencoder to perturb ${ m q}_0$ and produce ensembles that are propagated forward. Experiments on Predator-Prey, MovingMNIST, WeatherBench, and additional datasets show that the resulting ensemble means and variances closely track true distributions, with favorable MSE, MAE, and SSIM metrics, demonstrating a scalable approach for uncertainty quantification in high-dimensional dynamical systems.
Abstract
The modeling of dynamical systems is essential in many fields, but applying machine learning techniques is often challenging due to incomplete or noisy data. This study introduces a variant of stochastic interpolation (SI) for probabilistic forecasting, estimating future states as distributions rather than single-point predictions. We explore its mathematical foundations and demonstrate its effectiveness on various dynamical systems, including the challenging WeatherBench dataset.
