Identifying latent state transition in non-linear dynamical systems
Çağlar Hızlı, Çağatay Yıldız, Matthias Bethge, ST John, Pekka Marttinen
TL;DR
The paper tackles identifying latent states and the nonlinear transition function of dynamical systems from high-dimensional observations by introducing an identifiable state-space framework with augmented dynamics and an auxiliary variable to induce nonstationarity or autocorrelation. It proves identifiability for both latent states (Theorem 1) and the transition function (Theorem 2) under structured assumptions, and implements a variational autoencoder-based algorithm that jointly recovers latents, dynamics, and process noise. Through synthetic and modified Cartpole experiments, the method achieves superior latent recovery, calibrated uncertainty, and longer-horizon prediction, while demonstrating data-efficient adaptation to new environments. The work advances interpretable, transferable dynamical modeling with potential applications in model-based control and policy learning, while acknowledging limitations tied to the identifiability assumptions and simulated testbeds.
Abstract
This work aims to improve generalization and interpretability of dynamical systems by recovering the underlying lower-dimensional latent states and their time evolutions. Previous work on disentangled representation learning within the realm of dynamical systems focused on the latent states, possibly with linear transition approximations. As such, they cannot identify nonlinear transition dynamics, and hence fail to reliably predict complex future behavior. Inspired by the advances in nonlinear ICA, we propose a state-space modeling framework in which we can identify not just the latent states but also the unknown transition function that maps the past states to the present. We introduce a practical algorithm based on variational auto-encoders and empirically demonstrate in realistic synthetic settings that we can (i) recover latent state dynamics with high accuracy, (ii) correspondingly achieve high future prediction accuracy, and (iii) adapt fast to new environments.
