Simultaneous model discovery and state estimation under high data corruption
Teddy Meissner, Karl Glasner
TL;DR
The paper addresses learning ODEs from highly corrupted data by jointly inferring state trajectories and sparse models using a hybrid likelihood-based regression framework. It introduces a sparse, library-based regression with a smooth $L_0$-like penalty and BIC-based model selection, solved with a second-order Levenberg–Marquardt method that exploits Hessian sparsity via graph coloring and automatic differentiation. Extensive experiments on the Van der Pol, Lorenz, Lorenz-96, and Colpitts systems demonstrate robustness to noise and data gaps and competitive performance against WSINDy. The approach is general to ODEs and potentially extensible to DAEs and PDEs, offering interpretable, data-driven models with limited tuning requirements.
Abstract
This paper proposes a sparse regression strategy for discovery of ordinary differential equations from incomplete and noisy data. Inference is performed over both equation parameters and state variables using a statistically motivated likelihood function. Sparsity is enforced by a selection algorithm which iteratively removes terms and compares models using statistical information criteria. Large scale optimization is performed using a second-order variant of the Levenberg-Marquardt method, where the gradient and Hessian are computed via automatic differentiation. The proposed method is illustrated and tested on several systems with varying levels of noisy and incomplete data. Comparisons are made to a state-of-the-art algorithm for system identification, demonstrating competitiveness of the proposed approach.
