Learning to Stabilize Unknown LTI Systems on a Single Trajectory under Stochastic Noise
Ziyi Zhang, Yorie Nakahira, Guannan Qu
TL;DR
This work tackles stabilizing an unknown noisy LTI system from a single trajectory, where instability concentrates in a low-dimensional subspace. It introduces LTS0-N, a model-based method that first identifies the unstable subspace via SVD, then estimates the unstable dynamics and the effect of control on that subspace, and finally designs a τ-hop stabilizing controller. Theoretical results show stabilization with a state-norm upper bound of $2^{O(k \log k + \log(n-k) + m - \log\text{gap})}$, avoiding the traditional exponential blow-up in the full state dimension $n$, especially when $m=O(k)$. The approach is robust to stochastic noise and under-actuated settings, and simulations demonstrate faster stabilization than existing benchmarks on a single trajectory, highlighting practical relevance for high-dimensional, uncertain systems.
Abstract
We study the problem of learning to stabilize unknown noisy Linear Time-Invariant (LTI) systems on a single trajectory. It is well known in the literature that the learn-to-stabilize problem suffers from exponential blow-up in which the state norm blows up in the order of $Θ(2^n)$ where $n$ is the state space dimension. This blow-up is due to the open-loop instability when exploring the $n$-dimensional state space. To address this issue, we develop a novel algorithm that decouples the unstable subspace of the LTI system from the stable subspace, based on which the algorithm only explores and stabilizes the unstable subspace, the dimension of which can be much smaller than $n$. With a new singular-value-decomposition(SVD)-based analytical framework, we prove that the system is stabilized before the state norm reaches $2^{O(k \log n)}$, where $k$ is the dimension of the unstable subspace. Critically, this bound avoids exponential blow-up in state dimension in the order of $Θ(2^n)$ as in the previous works, and to the best of our knowledge, this is the first paper to avoid exponential blow-up in dimension for stabilizing LTI systems with noise.
