Topological Approach for Data Assimilation
Max M. Chumley, Firas A. Khasawneh
TL;DR
This paper introduces TADA, a noise-agnostic data assimilation framework that blends random feature map forecasting with topological data analysis. By differentiating through persistence diagrams and minimizing Wasserstein-based topological differences between forecast and observations, TADA updates the model without requiring measurement noise statistics. The method is validated on chaotic Lorenz 63 and extended to Lorenz 96, showing robustness to white, pink, and Brownian noise and scalability to higher dimensions, albeit with careful hyperparameter tuning. The work provides code for reproducibility and highlights a potentially impactful approach for topology-informed data assimilation in complex dynamical systems.
Abstract
Many dynamical systems are difficult or impossible to model using high fidelity physics based models. Consequently, researchers are relying more on data driven models to make predictions and forecasts. Based on limited training data, machine learning models often deviate from the true system states over time and need to be continually updated as new measurements are taken using data assimilation. Classical data assimilation algorithms typically require knowledge of the measurement noise statistics which may be unknown. In this paper, we introduce a new data assimilation algorithm with a foundation in topological data analysis. By leveraging the differentiability of functions of persistence, gradient descent optimization is used to minimize topological differences between measurements and forecast predictions by tuning data driven model coefficients without using noise information from the measurements. We describe the method and focus on its capabilities performance using the chaotic Lorenz 63 system as an example and we also show that the method works on a higher dimensional example with the Lorenz 96 system.
