An updated look on the convergence and consistency of data-driven dynamical models

Kristian Løvland; Bjarne Grimstad; Lars Struen Imsland

An updated look on the convergence and consistency of data-driven dynamical models

Kristian Løvland, Bjarne Grimstad, Lars Struen Imsland

TL;DR

The paper addresses convergence and consistency for nonlinear probabilistic sequence models used to learn controlled dynamical systems under maximum likelihood estimation. It proves that, under stability-like forgetting and regularity conditions, the log-likelihood objective $L_T(\theta)$ converges uniformly to a limit $\overline{L}(\theta)$ and the ML estimator satisfies $\hat{\theta}_T \to \arg\max_{\theta} \overline{L}(\theta)$. A consistency result for $s$-th order Markov properties shows ML converges to a KL-divergence minimizer, and with full support yields recovery of the true conditional $p(Y|\Phi)$. Two illustrative examples—linear system identification and finite-state Markov chains—demonstrate the theory’s applicability to both continuous and discrete observation spaces. Overall, the work extends classical PE identification to probabilistic sequence models with long memory and general observation spaces, informing design and analysis of data-driven dynamical models in control and RL contexts.

Abstract

Deep sequence models are receiving significant interest in current machine learning research. By representing probability distributions that are fit to data using maximum likelihood estimation, such models can model data on general observation spaces (both continuous and discrete-valued). Furthermore, they can be applied to a wide range of modelling problems, including modelling of dynamical systems which are subject to control. The problem of learning data-driven models of systems subject to control is well studied in the field of system identification. In particular, there exist theoretical convergence and consistency results which can be used to analyze model behaviour and guide model development. However, these results typically concern models which provide point predictions of continuous-valued variables. Motivated by this, we derive convergence and consistency results for a class of nonlinear probabilistic models defined on a general observation space. The results rely on stability and regularity assumptions, and can be used to derive consistency conditions and bias expressions for nonlinear probabilistic models of systems under control. We illustrate the results on examples from linear system identification and Markov chains on finite state spaces.

An updated look on the convergence and consistency of data-driven dynamical models

TL;DR

converges uniformly to a limit

and the ML estimator satisfies

. A consistency result for

-th order Markov properties shows ML converges to a KL-divergence minimizer, and with full support yields recovery of the true conditional

. Two illustrative examples—linear system identification and finite-state Markov chains—demonstrate the theory’s applicability to both continuous and discrete observation spaces. Overall, the work extends classical PE identification to probabilistic sequence models with long memory and general observation spaces, informing design and analysis of data-driven dynamical models in control and RL contexts.

Abstract

Paper Structure (26 sections, 7 theorems, 47 equations)

This paper contains 26 sections, 7 theorems, 47 equations.

Introduction
Structure of paper
Notation
Related work
Consistency of Maximum Likelihood
Prediction Error Identification
Hidden Markov Models
Problem statement
Dataset
Model class
Objective function
Goal
Assumptions
Stability
Regularity
...and 11 more sections

Key Result

Proposition 1

Assume that the closed-loop system $\{U_t, Y_t\}$ is $r$-mean exponentially stable with $r=4$ (def. 4.8 in abdalmoaty2019identification). Furthermore, assume that all models $q_{\theta} \in \mathcal{M}$ is on the form $q_{\theta}(Y_t \,|\, U_{1:t-1}, Y_{1:t-1}) = \mathcal{N}(Y_t; f_{\theta}(U_{1:t-1

Theorems & Definitions (14)

Proposition 1
proof
Theorem 1
proof
Corollary 1
proof
Lemma 1
proof
Corollary 2
proof
...and 4 more

An updated look on the convergence and consistency of data-driven dynamical models

TL;DR

Abstract

An updated look on the convergence and consistency of data-driven dynamical models

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (14)