Simultaneous Latent State Estimation and Latent Linear Dynamics Discovery from Image Observations

Nikita Kostin

Simultaneous Latent State Estimation and Latent Linear Dynamics Discovery from Image Observations

Nikita Kostin

TL;DR

This work addresses estimating latent state trajectories from high-dimensional image observations under near-linear latent dynamics. It surveys traditional filters and neural approaches, then introduces Normalizing Flows based Particle Filter (NFPF), which uses a conditional NF to model the observation likelihood $p(\mathbf{y}|\mathbf{x})$ and learns time-varying linear dynamics via neural networks. The framework jointly optimizes NF parameters, latent-state projections, and dynamic matrices, enabling full posterior filtering rather than only samples. While offering a principled density-based approach, the method faces computational and data limitations, highlighting a trade-off between SINDy-style latent-dynamics discovery and NF-based posterior estimation for image-based state estimation.

Abstract

The problem of state estimation has a long history with many successful algorithms that allow analytical derivation or approximation of posterior filtering distribution given the noisy observations. This report tries to conclude previous works to resolve the problem of latent state estimation given image-based observations and also suggests a new solution to this problem.

Simultaneous Latent State Estimation and Latent Linear Dynamics Discovery from Image Observations

TL;DR

and learns time-varying linear dynamics via neural networks. The framework jointly optimizes NF parameters, latent-state projections, and dynamic matrices, enabling full posterior filtering rather than only samples. While offering a principled density-based approach, the method faces computational and data limitations, highlighting a trade-off between SINDy-style latent-dynamics discovery and NF-based posterior estimation for image-based state estimation.

Abstract

Paper Structure (11 sections, 11 equations, 5 figures)

This paper contains 11 sections, 11 equations, 5 figures.

Introduction
Materials and Methods
Filtering algorithms
Kalman Filter
Particle Filter (PF)
Autoencoders (AE)
Normalizing Flows (NF)
Normalizing Flows based Particle Filter (NFPF)
Modelling observation likelihood using conditional NF
Experimental evaluation
Discussion and Conclusion

Figures (5)

Figure 1: Autoencoder architecture: here $\mathbf{x}(t)$ stands for high-dimensional observation, $\mathbf{z}(t)$ stands for latent state and $\mathbf{\hat{x}}(t)$ is reconstructed observation by the decoder $\psi$.
Figure 2: Proposed model architecture: $\mathbf{y}_t$ denotes image observation, $\hat{\mathbf{y}}_t$ denotes flow base distributed variables (Gaussian), $\mathbf{x}_t$ denotes latent state, $\mathbf{A}_t$ denotes dynamics matrix, $\mathbf{u}_t$ denotes control signal, $\mathbf{B}_t$ denotes control matrix, $\mu_0$ and $\Sigma_0$ are mean and covariance of the first latent state. Linear dynamics matrices are obtained using dynamics network $f_{\psi}$, $g_{\theta}$ denotes NF transformation, while $\mu_{\phi}$ stands for mean parametrization for $\hat{\mathbf{y}}$
Figure 3: RGB observation of CartPole used in the experiments.
Figure 4: Particle filter with 2 particles NF observation likelihood against true state evolution given the same control inputs for 4 dimensions.
Figure 5: PF with 2 particles NF observation likelihood against true state evolution given the same control inputs for 100 dimensions.

Simultaneous Latent State Estimation and Latent Linear Dynamics Discovery from Image Observations

TL;DR

Abstract

Simultaneous Latent State Estimation and Latent Linear Dynamics Discovery from Image Observations

Authors

TL;DR

Abstract

Table of Contents

Figures (5)