Transformers as Implicit State Estimators: In-Context Learning in Dynamical Systems

Usman Akram; Haris Vikalo

Transformers as Implicit State Estimators: In-Context Learning in Dynamical Systems

Usman Akram, Haris Vikalo

TL;DR

This work demonstrates that frozen transformers, trained through in-context learning on synthetic dynamical trajectories, can perform latent-state estimation for both linear-Gaussian and nonlinear dynamical systems without test-time gradient updates. The authors construct Kalman-filter-like operations using transformer primitives via a RAW-like framework, and show that the transformer’s predictions converge toward Kalman-filter behavior as context length and model scale increase. In nonlinear settings, the Transformer attains accuracy comparable to EKF and PF, and in some cases surpasses them, illustrating robust, data-driven inference. The findings imply that transformer-based in-context learning can serve as a flexible, non-parametric approach to output prediction and latent-state estimation in dynamical systems, with robustness to missing model information and potential applicability to control tasks.

Abstract

Predicting the behavior of a dynamical system from noisy observations of its past outputs is a classical problem encountered across engineering and science. For linear systems with Gaussian inputs, the Kalman filter -- the best linear minimum mean-square error estimator of the state trajectory -- is optimal in the Bayesian sense. For nonlinear systems, Bayesian filtering is typically approached using suboptimal heuristics such as the Extended Kalman Filter (EKF), or numerical methods such as particle filtering (PF). In this work, we show that transformers, employed in an in-context learning (ICL) setting, can implicitly infer hidden states in order to predict the outputs of a wide family of dynamical systems, without test-time gradient updates or explicit knowledge of the system model. Specifically, when provided with a short context of past input-output pairs and, optionally, system parameters, a frozen transformer accurately predicts the current output. In linear-Gaussian regimes, its predictions closely match those of the Kalman filter; in nonlinear regimes, its performance approaches that of EKF and PF. Moreover, prediction accuracy degrades gracefully when key parameters, such as the state-transition matrix, are withheld from the context, demonstrating robustness and implicit parameter inference. These findings suggest that transformer in-context learning provides a flexible, non-parametric alternative for output prediction in dynamical systems, grounded in implicit latent-state estimation.

Transformers as Implicit State Estimators: In-Context Learning in Dynamical Systems

TL;DR

Abstract

Transformers as Implicit State Estimators: In-Context Learning in Dynamical Systems

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (12)