State-space models are accurate and efficient neural operators for dynamical systems
Zheyuan Hu, Nazanin Ahmadi Daryakenari, Qianli Shen, Kenji Kawaguchi, George Em Karniadakis
TL;DR
The paper addresses the challenge of accurately and efficiently learning operators for dynamical systems, especially under long-time integration and extrapolation, by introducing Mamba, a state-space model that dynamically captures long-range dependencies with linear-time inference. Through extensive benchmarks spanning 1D ODEs, discontinuous dynamics, long-time integration, chaotic systems, and a real-world PK-PD application, Mamba consistently matches or outperforms strong baselines (RNNs, Transformers, neural operators) while achieving lower computational costs. The results demonstrate Mamba's robustness in interpolation and extrapolation, its scalability to long sequences, and its applicability to data-scarce real-world problems when combined with physics information. Overall, Mamba positions state-space modeling as a powerful, efficient framework for scientific machine learning in dynamical-systems modeling, with clear potential for PDE extensions and parameter-inference tasks in pharmacology and beyond.
Abstract
Physics-informed machine learning (PIML) has emerged as a promising alternative to classical methods for predicting dynamical systems, offering faster and more generalizable solutions. However, existing models, including recurrent neural networks (RNNs), transformers, and neural operators, face challenges such as long-time integration, long-range dependencies, chaotic dynamics, and extrapolation, to name a few. To this end, this paper introduces state-space models implemented in Mamba for accurate and efficient dynamical system operator learning. Mamba addresses the limitations of existing architectures by dynamically capturing long-range dependencies and enhancing computational efficiency through reparameterization techniques. To extensively test Mamba and compare against another 11 baselines, we introduce several strict extrapolation testbeds that go beyond the standard interpolation benchmarks. We demonstrate Mamba's superior performance in both interpolation and challenging extrapolation tasks. Mamba consistently ranks among the top models while maintaining the lowest computational cost and exceptional extrapolation capabilities. Moreover, we demonstrate the good performance of Mamba for a real-world application in quantitative systems pharmacology for assessing the efficacy of drugs in tumor growth under limited data scenarios. Taken together, our findings highlight Mamba's potential as a powerful tool for advancing scientific machine learning in dynamical systems modeling. (The code will be available at https://github.com/zheyuanhu01/State_Space_Model_Neural_Operator upon acceptance.)
