Long-time accuracy of ensemble Kalman filters for chaotic and machine-learned dynamical systems
Daniel Sanz-Alonso, Nathan Waniorek
TL;DR
This work establishes rigorous long-time accuracy guarantees for ensemble Kalman filters (EnKF) applied to partially observed, high-dimensional, chaotic dynamical systems, including Navier–Stokes and Lorenz models. It proves two main results: (i) long-time accuracy of the square-root EnKF with inflation for the true dynamics, and (ii) long-time accuracy when the dynamics are replaced by machine-learned surrogate models, provided the surrogates have controlled short-term error in the unobserved components. The proofs leverage a mean-field Gaussian projected filter as an intermediate step and are complemented by mean-field surrogate analyses, yielding error bounds that scale with the observation noise level ε (and surrogate error δ) rather than the full state dimension. Numerical experiments on Lorenz-96 corroborate the theory, showing accurate long-time state estimation even with surrogates that are only short-term accurate, thereby supporting the practical use of ML surrogates in data assimilation for geophysical systems.
Abstract
Filtering is concerned with online estimation of the state of a dynamical system from partial and noisy observations. In applications where the state is high dimensional, ensemble Kalman filters are often the method of choice. This paper establishes long-time accuracy of ensemble Kalman filters. We introduce conditions on the dynamics and the observations under which the estimation error remains small in the long-time horizon. Our theory covers a wide class of partially-observed chaotic dynamical systems, which includes the Navier-Stokes equations and Lorenz models. In addition, we prove long-time accuracy of ensemble Kalman filters with surrogate dynamics, thus validating the use of machine-learned forecast models in ensemble data assimilation.
