Automating the Discovery of Partial Differential Equations in Dynamical Systems
Weizhen Li, Rui Carvalho
TL;DR
ARGOS-RAL addresses the problem of discovering PDEs from data by extending ARGOS with a recurrent adaptive lasso to identify governing equations directly from spatiotemporal data. It automates numerical differentiation via Savitzky-Golay filtering and Gaussian blur, builds a rich candidate library, and solves a single sparse regression across all time points with iterative reweighting and AIC-based model selection. Across diverse canonical PDEs, ARGOS-RAL demonstrates robustness to noise and nonuniform sampling and often outperforms STRidge, though some equations require more data and the method remains library-dependent with limited uncertainty quantification. The approach promises automated, scalable PDE discovery across physics, biology, and engineering by integrating statistical regression, machine learning, and dynamical-systems theory.
Abstract
Identifying partial differential equations (PDEs) from data is crucial for understanding the governing mechanisms of natural phenomena, yet it remains a challenging task. We present an extension to the ARGOS framework, ARGOS-RAL, which leverages sparse regression with the recurrent adaptive lasso to identify PDEs from limited prior knowledge automatically. Our method automates calculating partial derivatives, constructing a candidate library, and estimating a sparse model. We rigorously evaluate the performance of ARGOS-RAL in identifying canonical PDEs under various noise levels and sample sizes, demonstrating its robustness in handling noisy and non-uniformly distributed data. We also test the algorithm's performance on datasets consisting solely of random noise to simulate scenarios with severely compromised data quality. Our results show that ARGOS-RAL effectively and reliably identifies the underlying PDEs from data, outperforming the sequential threshold ridge regression method in most cases. We highlight the potential of combining statistical methods, machine learning, and dynamical systems theory to automatically discover governing equations from collected data, streamlining the scientific modeling process.
