Learning Dynamical Systems by Leveraging Data from Similar Systems

Lei Xin; Lintao Ye; George Chiu; Shreyas Sundaram

Learning Dynamical Systems by Leveraging Data from Similar Systems

Lei Xin, Lintao Ye, George Chiu, Shreyas Sundaram

TL;DR

This work studies finite-sample learning of a linear time-invariant system using data from a similar auxiliary system. It introduces a weighted least-squares framework that blends true- and auxiliary-system rollouts with a tunable weight $q$ and optional regularization $\lambda$, and it derives data-independent and data-dependent error bounds that decompose the estimation error into noise-driven and model-difference components. The results show that auxiliary data can reduce intrinsic noise error at the cost of bias from model mismatch, and they provide guidelines and computable bounds to select $q$ and $\lambda$. Through numerical experiments, the authors illustrate how trajectory lengths and $q$ affect performance across scenarios and demonstrate how the data-dependent bound can guide practical weight selection. Overall, the paper offers a principled transfer-learning-like approach for system identification with provable guarantees and practical guidance for leveraging related systems.

Abstract

We consider the problem of learning the dynamics of a linear system when one has access to data generated by an auxiliary system that shares similar (but not identical) dynamics, in addition to data from the true system. We use a weighted least squares approach, and provide finite sample error bounds of the learned model as a function of the number of samples and various system parameters from the two systems as well as the weight assigned to the auxiliary data. We show that the auxiliary data can help to reduce the intrinsic system identification error due to noise, at the price of adding a portion of error that is due to the differences between the two system models. We further provide a data-dependent bound that is computable when some prior knowledge about the systems, such as upper bounds on noise levels and model difference, is available. This bound can also be used to determine the weight that should be assigned to the auxiliary data during the model training stage.

Learning Dynamical Systems by Leveraging Data from Similar Systems

TL;DR

and optional regularization

, and it derives data-independent and data-dependent error bounds that decompose the estimation error into noise-driven and model-difference components. The results show that auxiliary data can reduce intrinsic noise error at the cost of bias from model mismatch, and they provide guidelines and computable bounds to select

and

. Through numerical experiments, the authors illustrate how trajectory lengths and

affect performance across scenarios and demonstrate how the data-dependent bound can guide practical weight selection. Overall, the paper offers a principled transfer-learning-like approach for system identification with provable guarantees and practical guidance for leveraging related systems.

Abstract

Paper Structure (19 sections, 18 theorems, 93 equations, 9 figures)

This paper contains 19 sections, 18 theorems, 93 equations, 9 figures.

Introduction
Mathematical Notation and Terminology
Problem formulation and algorithm
Finite Sample Guarantees of the System Identification Error
Data-independent Bounds
Data-dependent Bound
Numerical Experiments to Illustrate Various Scenarios for System Identification from Auxiliary Data
Predetermined $q$
Scenario 1: Both $T_{r}$ and $T_{p}$ are increasing
Scenario 2: $T_{p}$ is fixed but $T_{r}$ is increasing
Scenario 3: $T_{r}$ is fixed but $T_{p}$ is increasing
Selecting $q$ based on Theorem \ref{['data-dependent bound']}
Conclusion
Intermediate Results
Proof of Theorem \ref{['data-independent bound']}
...and 4 more sections

Key Result

Theorem 1

Fix $q\geq 0$, $\delta \in (0,\frac{2}{e})$, and let Assumption assumption hold. Denote $\bar{\zeta}=\frac{\bar{\sigma}_{min}}{c_{1}\bar{\sigma}_{*}}$ and $\hat{\zeta}=\frac{\hat{\sigma}_{min}}{c_{1}\hat{\sigma}_{*}}$. Suppose that $N_{r}T_{r}\geq \max\{33,8c_{1}^2\bar{\sigma}_{*}^2(\log\frac{2}{\de where

Figures (9)

Figure 1: Scenario 1: Both $T_r$ and $T_p$ increase over time ($T_p = 3T_r$)
Figure 2: Scenario 2: $T_p$ is fixed, and $T_r$ increases over time
Figure 3: Scenario 3: $T_r$ is fixed, and $T_p$ increases over time
Figure 4: Baseline case 1: $\mathbf{\Delta}=0.1, \sigma_{\bar{w}}=\sigma_{\hat{w}}=1, N_{r}=20$. An intermediate value of $q$ is optimal
Figure 5: Baseline case 2: $\mathbf{\Delta}=0.11, \sigma_{\bar{w}}=1, \sigma_{\hat{w}}=1.1, N_{r}=19$. An intermediate value of $q$ is optimal
...and 4 more figures

Theorems & Definitions (38)

Remark 1
Definition 1
Theorem 1
Remark 2
Corollary 1
Remark 3
Proposition 1
Remark 4
Theorem 2
Remark 5
...and 28 more

Learning Dynamical Systems by Leveraging Data from Similar Systems

TL;DR

Abstract

Learning Dynamical Systems by Leveraging Data from Similar Systems

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (9)

Theorems & Definitions (38)