Robust Data-Driven Kalman Filtering for Unknown Linear Systems using Maximum Likelihood Optimization
Peihu Duan, Tao Liu, Yu Xing, Karl Henrik Johansson
TL;DR
This work addresses state estimation for unknown linear systems with both process and measurement noise by exploiting a pre-collected high-frequency input-output trajectory and a lower-frequency state trajectory. It introduces the Robust Data-Driven Kalman Filter (RDKF), built from a maximum-likelihood formulation and a recursive, tractable modification that yields state estimates without knowing $A$, $B$, or $C$; it also derives a data-driven sample-complexity bound that shows the estimation gap to the model-based Kalman filter vanishes as the amount of pre-collected data grows. The analysis covers data informativity, matrix-identification error bounds, and conditions under which the learned filter remains observable and stable, with extensions to cases where only input-output data are available, yielding a balanced realization suitable for LQG control. Numerical experiments on a chemical reactor (CSTR) demonstrate improved filtering performance with more pre-collected data and highlight robustness to data noise, validating the theoretical guarantees and practical applicability in scenarios where state sampling is slower than input-output measurements.
Abstract
This paper investigates the state estimation problem for unknown linear systems subject to both process and measurement noise. Based on a prior input-output trajectory sampled at a higher frequency and a prior state trajectory sampled at a lower frequency, we propose a novel robust data-driven Kalman filter (RDKF) that integrates model identification with state estimation for the unknown system. Specifically, the state estimation problem is formulated as a non-convex maximum likelihood optimization problem. Then, we slightly modify the optimization problem to get a problem solvable with a recursive algorithm. Based on the optimal solution to this new problem, the RDKF is designed, which can estimate the state of a given but unknown state-space model. The performance gap between the RDKF and the optimal Kalman filter based on known system matrices is quantified through a sample complexity bound. In particular, when the number of the pre-collected states tends to infinity, this gap converges to zero. Finally, the effectiveness of the theoretical results is illustrated by numerical simulations.
