Data-assimilated model-informed reinforcement learning

Defne E. Ozan; Andrea Nóvoa; Georgios Rigas; Luca Magri

Data-assimilated model-informed reinforcement learning

Defne E. Ozan, Andrea Nóvoa, Georgios Rigas, Luca Magri

TL;DR

This work tackles controlling spatio-temporally chaotic systems under partial and noisy observability by proposing DA-MIRL, a framework that couples a predictive environment model, ensemble Kalman filter state estimation, and an off-policy actor-critic RL agent. By integrating a physics-based (truncated Fourier) or a data-driven (control-aware ESN) model with real-time data assimilation, the approach converts a POMDP into a practically tractable MDP for learning. The KS equation serves as a challenging testbed, where DA-MIRL achieves robust stabilization with significantly fewer sensors than model-free RL and maintains performance across varying chaotic regimes. The results highlight the practical impact of modular observer-controller design, enabling real-time control of complex chaotic dynamics in scenarios with limited sensing and noisy data. Overall, the framework broadens the applicability of RL-based control to partially observable, high-dimensional chaotic systems, with scalable, open-source implementations.

Abstract

The control of spatio-temporally chaos is challenging because of high dimensionality and unpredictability. Model-free reinforcement learning (RL) discovers optimal control policies by interacting with the system, typically requiring observations of the full physical state. In practice, sensors often provide only partial and noisy measurements (observations) of the system. The objective of this paper is to develop a framework that enables the control of chaotic systems with partial and noisy observability. The proposed method, data-assimilated model-informed reinforcement learning (DA-MIRL), integrates (i) low-order models to approximate high-dimensional dynamics; (ii) sequential data assimilation to correct the model prediction when observations become available; and (iii) an off-policy actor-critic RL algorithm to adaptively learn an optimal control strategy based on the corrected state estimates. We test DA-MIRL on the spatiotemporally chaotic solutions of the Kuramoto-Sivashinsky equation. We estimate the full state of the environment with (i) a physics-based model, here, a coarse-grained model; and (ii) a data-driven model, here, the control-aware echo state network, which is proposed in this paper. We show that DA-MIRL successfully estimates and suppresses the chaotic dynamics of the environment in real time from partial observations and approximate models. This work opens opportunities for the control of partially observable chaotic systems.

Data-assimilated model-informed reinforcement learning

TL;DR

Abstract

Data-assimilated model-informed reinforcement learning

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (15)