On Foundation Models for Dynamical Systems from Purely Synthetic Data
Martin Ziegler, Andres Felipe Posada-Moreno, Friedrich Solowjow, Sebastian Trimpe
TL;DR
The paper investigates whether foundation-models can perform low-level state prediction for dynamical systems by pretraining a decoder-only transformer on purely synthetic RKHS-generated trajectories. The proposed TimesFM-style architecture generalizes to unseen systems in both simulation and hardware (cart-pole and Furuta pendulum) and can be effectively fine-tuned with limited data to improve performance. Key contributions include a scalable RKHS-based data generation pipeline, an autoregressive transformer tailored for time-series dynamics, and a thorough evaluation of generalization, data efficiency, and robustness. The results indicate that synthetic-data pretraining can yield a robust, data-efficient foundation for real-time control, with practical implications for deploying generalist models across diverse dynamical domains.
Abstract
Foundation models have demonstrated remarkable generalization, data efficiency, and robustness properties across various domains. In this paper, we explore the feasibility of foundation models for applications in the control domain. The success of these models is enabled by large-scale pretaining on Internet-scale datasets. These are available in fields like natural language processing and computer vision, but do not exist for dynamical systems. We address this challenge by pretraining a transformer-based foundation model exclusively on synthetic data and propose to sample dynamics functions from a reproducing kernel Hilbert space. Our pretrained model generalizes for prediction tasks across different dynamical systems, which we validate in simulation and hardware experiments, including cart-pole and Furuta pendulum setups. Additionally, the model can be fine-tuned effectively to new systems to increase performance even further. Our results demonstrate the feasibility of foundation models for dynamical systems that outperform specialist models in terms of generalization, data efficiency, and robustness.
