Efficient identification of linear, parameter-varying, and nonlinear systems with noise models
Alberto Bemporad, Roland Tóth
TL;DR
Addresses the challenge of identifying a broad class of dynamical systems from data by introducing a unified framework that separates deterministic dynamics $G_ ext{o}$ and the noise model $H_ ext{o}$, and parameterizes nonlinearities with artificial neural networks. The approach yields a prediction-error minimization (PEM) estimator applicable to $LTI$, $LPV$ (external and self-scheduled), and $NL$ models, with efficient training via $L$-BFGS-$B$ and automatic differentiation in $JAX$. The authors prove convergence and consistency under mild assumptions and demonstrate that training times drop from hours to seconds while achieving or surpassing state-of-the-art accuracy on benchmarks including unbalanced disk and CMG systems. This work offers a scalable, rigorous tool for data-driven system identification across a spectrum of model classes and noise structures, enabling reliable controller design and adaptation.
Abstract
We present a general system identification procedure capable of estimating of a broad spectrum of state-space dynamical models, including linear time-invariant (LTI), linear parameter-varying} (LPV), and nonlinear (NL) dynamics, along with rather general classes of noise models. Similar to the LTI case, we show that for this general class of model structures, including the NL case, the model dynamics can be separated into a deterministic process and a stochastic noise part, allowing to seamlessly tune the complexity of the combined model both in terms of nonlinearity and noise modeling. We parameterize the involved nonlinear functional relations by means of artificial neural-networks (ANNs), although alternative parametric nonlinear mappings can also be used. To estimate the resulting model structures, we optimize a prediction-error-based criterion using an efficient combination of a constrained quasi-Newton approach and automatic differentiation, achieving training times in the order of seconds compared to existing state-of-the-art ANN methods which may require hours for models of similar complexity. We formally establish the consistency guarantees for the proposed approach and demonstrate its superior estimation accuracy and computational efficiency on several benchmark LTI, LPV, and NL system identification problems.
