Data-driven approximation of Koopman operators and generators: Convergence rates and error bounds
Liam Llamazares-Elias, Samir Llamazares-Elias, Jonas Latz, Stefan Klus
TL;DR
This work develops a data-driven, Monte Carlo Galerkin framework to approximate linear transfer operators and their infinitesimal generators for dynamical systems, unifying EDMD and gEDMD under a common theory. It proves almost-sure convergence of the data-driven operator $\widehat{\mathcal{A}}_{NM}$ to the Galerkin projection $\mathcal{A}_N$ and establishes convergence of eigenvalues and eigenfunctions, with explicit rates and robustness to measurement noise. The analysis extends to joint limits in dictionary size $N$ and data size $M$, without requiring Gram-matrix invertibility, and provides explicit error bounds in terms of Gram and structure matrices. Numerical experiments on deterministic and stochastic systems corroborate the theory, showing predictable convergence rates, spectral recovery, and varying robustness to noise across basis choices, thereby validating a versatile, scalable framework that subsumes EDMD and gEDMD.
Abstract
Global information about dynamical systems can be extracted by analysing associated infinite-dimensional transfer operators, such as Perron-Frobenius and Koopman operators as well as their infinitesimal generators. In practice, these operators typically need to be approximated from data. Popular approximation methods are extended dynamic mode decomposition (EDMD) and generator extended mode decomposition (gEDMD). We propose a unified framework that leverages Monte Carlo sampling to approximate the operator of interest on a finite-dimensional space spanned by a set of basis functions. Our framework contains EDMD and gEDMD as special cases, but can also be used to approximate more general operators. Our key contributions are proofs of the convergence of the approximating operator and its spectrum under non-restrictive conditions. Moreover, we derive explicit convergence rates and account for the presence of noise in the observations. Whilst all these results are broadly applicable, they also refine previous analyses of EDMD and gEDMD. We verify the analytical results with the aid of several numerical experiments.
