When do World Models Successfully Learn Dynamical Systems?
Edmund Ross, Claudia Drygala, Leonhard Schwarz, Samir Kaiser, Francesca di Mare, Tobias Breiten, Hanno Gottschalk
TL;DR
The paper tackles learning dynamical systems governed by PDEs from tokenized observations using World Models. It formalizes a system-theoretic framework with a tokenization $h:\mathcal{X}\to\mathbb{R}^m$, latent state $y=h(x)$, reconstruction $G:\mathcal{Y}\to\mathcal{X}$, and autoregressive update $g:\mathcal{Y}^k\to\mathcal{Y}$, and demonstrates that observability guarantees the existence of $G$ and $g$ along with a latent propagator $S_\Delta$ in the latent space. The authors prove PAC-like guarantees for learning $g$ and $G$ and validate the approach on heat, wave, KS, and cylinder-flow data, showing low prediction error and strong temporal coherence while achieving substantial speedups over LES. Compared to neural-operator baselines (e.g., FNO, DeepONet), the world-model framework offers improved long-horizon stability and efficiency, particularly for in-distribution and out-of-distribution test data. The work provides a principled link between observability and learnability for world models and demonstrates practical utility in turbulent-flow synthesis and CFD surrogacy.
Abstract
In this work, we explore the use of compact latent representations with learned time dynamics ('World Models') to simulate physical systems. Drawing on concepts from control theory, we propose a theoretical framework that explains why projecting time slices into a low-dimensional space and then concatenating to form a history ('Tokenization') is so effective at learning physics datasets, and characterise when exactly the underlying dynamics admit a reconstruction mapping from the history of previous tokenized frames to the next. To validate these claims, we develop a sequence of models with increasing complexity, starting with least-squares regression and progressing through simple linear layers, shallow adversarial learners, and ultimately full-scale generative adversarial networks (GANs). We evaluate these models on a variety of datasets, including modified forms of the heat and wave equations, the chaotic regime 2D Kuramoto-Sivashinsky equation, and a challenging computational fluid dynamics (CFD) dataset of a 2D Kármán vortex street around a fixed cylinder, where our model is successfully able to recreate the flow.
