EKF-SINDy: Empowering the extended Kalman filter with sparse identification of nonlinear dynamics
Luca Rosafalco, Paolo Conti, Andrea Manzoni, Stefano Mariani, Attilio Frangi
TL;DR
This work introduces EKF-SINDy, a data-driven framework that integrates an Extended Kalman Filter with Sparse Identification of Nonlinear Dynamics to identify nonlinear system dynamics and parameters from noisy, potentially partial observations. By training a SINDy model offline and evolving it in the EKF prediction step, the method yields a computationally efficient surrogate that also provides Jacobians needed by the EKF, enabling joint state and parameter estimation for autonomous and non-autonomous systems. The approach is demonstrated on a seismic shear-building model and a partially observed nonlinear resonator, with time-delay embedding enabling recovery of hidden states and robust estimation of stiffness and coupling parameters; results show accurate tracking, uncertainty quantification, and even effective performance when operating outside the SINDy training range. Overall, EKF-SINDy advances real-time digital-twin development by combining physics-informed sparsity with data assimilation, offering a scalable, interpretable alternative to fully black-box neural models for nonlinear dynamics.
Abstract
Measured data from a dynamical system can be assimilated into a predictive model by means of Kalman filters. Nonlinear extensions of the Kalman filter, such as the Extended Kalman Filter (EKF), are required to enable the joint estimation of (possibly nonlinear) system dynamics and of input parameters. To construct the evolution model used in the prediction phase of the EKF, we propose to rely on the Sparse Identification of Nonlinear Dynamics (SINDy). SINDy enables to identify the evolution model directly from preliminary acquired data, thus avoiding possible bias due to wrong assumptions and incorrect modelling of the system dynamics. Moreover, the numerical integration of a SINDy model leads to great computational savings compared to alternate strategies based on, e.g., finite elements. Last, SINDy allows an immediate definition of the Jacobian matrices required by the EKF to identify system dynamics and properties, a derivation that is usually extremely involved with physical models. As a result, combining the EKF with SINDy provides a data-driven computationally efficient, easy-to-apply approach for the identification of nonlinear systems, capable of robust operation even outside the range of training of SINDy. To demonstrate the potential of the approach, we address the identification of a linear non-autonomous system consisting of a shear building model excited by real seismograms, and the identification of a partially observed nonlinear system. The challenge arising from the use of SINDy when the system state is not entirely accessible has been relieved by means of time-delay embedding. The great accuracy and the small uncertainty associated with the state identification, where the state has been augmented to include system properties, underscores the great potential of the proposed strategy, paving the way for the setting of predictive digital twins in different fields.
