Filtering Jump Markov Systems with Partially Known Dynamics: A Model-Based Deep Learning Approach
George Stamatelis, George C. Alexandropoulos
TL;DR
The paper addresses real-time state estimation in Jump Markov Systems with unknown noise statistics and mode-transition dynamics. It introduces JMFNet, a model-based deep learning framework comprising a Mode Prediction NN and a Mode-Augmented KalmanNet that jointly infer discrete modes and continuous states via alternating least squares training. Empirical results across linear and nonlinear tasks—including target tracking, pendulum dynamics, Lorenz attractors, and traffic data—show JMFNet outperforms classical model-based filters, particle filters, and non-switching neural baselines, with pronounced gains in nonstationary or high-noise regimes. The approach is demonstrated to be robust to initialization and model mismatches and scalable to long-horizon sequences, suggesting significant practical impact for high-stakes JMS applications.
Abstract
This paper presents the Jump Markov Filtering Network (JMFNet), a novel model-based deep learning framework for real-time state-state estimation in jump Markov systems with unknown noise statistics and mode transition dynamics. A hybrid architecture comprising two Recurrent Neural Networks (RNNs) is proposed: one for mode prediction and another for filtering that is based on a mode-augmented version of the recently presented KalmanNet architecture. The proposed RNNs are trained jointly using an alternating least squares strategy that enables mutual adaptation without supervision of the latent modes. Extensive numerical experiments on linear and nonlinear systems, including target tracking, pendulum angle tracking, Lorenz attractor dynamics, and a real-life dataset demonstrate that the proposed JMFNet framework outperforms classical model-based filters (e.g., interacting multiple models and particle filters) as well as model-free deep learning baselines, particularly in non-stationary and high-noise regimes. It is also showcased that JMFNet achieves a small yet meaningful improvement over the KalmanNet framework, which becomes much more pronounced in complicated systems or long trajectories. Finally, the method's performance is empirically validated to be consistent and reliable, exhibiting low sensitivity to initial conditions, hyperparameter selection, as well as to incorrect model knowledge
