Simultaneous Latent State Estimation and Latent Linear Dynamics Discovery from Image Observations
Nikita Kostin
TL;DR
This work addresses estimating latent state trajectories from high-dimensional image observations under near-linear latent dynamics. It surveys traditional filters and neural approaches, then introduces Normalizing Flows based Particle Filter (NFPF), which uses a conditional NF to model the observation likelihood $p(\mathbf{y}|\mathbf{x})$ and learns time-varying linear dynamics via neural networks. The framework jointly optimizes NF parameters, latent-state projections, and dynamic matrices, enabling full posterior filtering rather than only samples. While offering a principled density-based approach, the method faces computational and data limitations, highlighting a trade-off between SINDy-style latent-dynamics discovery and NF-based posterior estimation for image-based state estimation.
Abstract
The problem of state estimation has a long history with many successful algorithms that allow analytical derivation or approximation of posterior filtering distribution given the noisy observations. This report tries to conclude previous works to resolve the problem of latent state estimation given image-based observations and also suggests a new solution to this problem.
