Integrating Score-Based Generative Modeling and Neural ODEs for Accurate Representation of Multiscale Chaotic Dynamics
Giulio Del Felice, Ludovico Theo Giorgini
TL;DR
The paper tackles predictive modeling of multiscale dynamical systems with slow statistically regular behavior and fast chaotic forcing. It introduces a hybrid framework where slow dynamics are governed by a Langevin equation whose drift is constructed from a learned score function via KGMM, ensuring preservation of the invariant measure, while fast chaotic forcing is captured by a Neural ODE trained on delay-embedded residuals, providing accurate short-horizon forecasts. It extends to cyclo-stationary forcing through state-space augmentation of clock variables, enabling autonomous learning in augmented space. Validation on Lorenz-63–driven bistable and tristable systems shows that the approach achieves faithful long-term statistics and reliable early-warning forecasts of rare transitions with lead times approaching the Lyapunov time, outperforming direct slow-variable models or Gaussian-noise closures. This data-driven methodology offers a scalable path to predictive modeling of complex multiscale phenomena where both stationary statistics and transient dynamics are essential, with potential applications in climate tipping points, turbulence, and other nonlinear systems.
Abstract
Multiscale dynamical systems characterized by interacting fast and slow processes are ubiquitous across scientific domains, from climate dynamics to fluid mechanics. Accurate modeling of such systems requires capturing both the long-term statistical properties governed by slow variables and the short-term transient dynamics driven by fast chaotic processes. We present a hybrid data-driven framework that integrates score-based generative modeling with Neural Ordinary Differential Equations (NODEs) to construct reduced-order models (ROMs) capable of reproducing both regimes. The slow dynamics are represented by a Langevin equation whose drift is informed by a score function learned via the K-means Gaussian Mixture Model (KGMM) method, ensuring faithful reproduction of the system's invariant measure. The fast chaotic forcing is modeled by a NODE trained on delay-embedded residuals extracted from observed trajectories, replacing conventional Gaussian noise approximations. We validate this approach on a hierarchy of prototypical metastable systems driven by Lorenz 63 dynamics, including bistable potentials with additive and multiplicative forcing, and tristable non-autonomous systems with cycloperiodic components. Our results demonstrate that the hybrid framework maintains statistical consistency over long time horizons while accurately forecasting rare critical transitions between metastable states with lead times approaching the Lyapunov time of the chaotic driver. This work establishes a principled methodology for combining statistical closure techniques with explicit surrogate models of fast dynamics, offering a pathway toward predictive modeling of complex multiscale phenomena where both long-term statistics and short-term transients are essential.
