State estimator design using Jordan based long short-term memory networks
Avneet Kaur, Kirsten Morris
TL;DR
The paper addresses nonlinear state estimation by combining model-based insight with data-driven learning, introducing Jordan-based long short-term memory networks (JLSTM) as a faster-training alternative to ELSTM for estimating system states from noisy observations. It formalizes both ERN/JRN and ELSTM/JLSTM architectures, proves universal approximation capabilities for JRNs in state estimation, and provides implementation details and training protocols. Through three numerical experiments (one linear, two nonlinear), JLSTM and ELSTM outperform traditional EKF in nonlinear settings and show NMSE comparable to KF in linear cases, with JLSTM achieving similar accuracy to ELSTM at significantly reduced training times. The results suggest JLSTM as a practical, scalable approach for nonlinear state estimation, with potential stability advantages to explore in future work.
Abstract
State estimation of a dynamical system refers to estimating the state of a system given an imperfect model, noisy measurements and some or no information about the initial state. While Kalman filtering is optimal for estimation of linear systems with Gaussian noises, calculation of optimal estimators for nonlinear systems is challenging. We focus on establishing a pathway to optimal estimation of high-order systems by using recurrent connections motivated by Jordan recurrent neural networks(JRNs). The results are compared to the corresponding Elman structure based long short-term memory network(ELSTM) and the KF for linear and EKF for nonlinear systems. The results suggest that for nonlinear systems, the use of long short-term memory networks can improve estimation error and also computation time. Also, the Jordan based long short-term memory networks(JLSTMs) require less training to achieve performance similar to ELSTMs.
