Stabilizing Linear Systems under Partial Observability: Sample Complexity and Fundamental Limits
Ziyi Zhang, Yorie Nakahira, Guannan Qu
TL;DR
The paper addresses stabilizing an unknown partially observable LTI system by isolating an unstable transfer component and learning it with a data-efficient lifted-Hankel SVD procedure. It combines a rank-$k$ Hankel factorization with robust $H_\infty$-based controller design, leveraging a small-gain condition on the stable subspace to guarantee full-system stability. The main result shows the data requirements scale with the number of unstable modes $k$ (as $O(k^2)$) and are independent of the full state dimension $n$, under standard controllability/observability and bound assumptions, with a steady treatment of non-diagonalizable unstable blocks. Simulations validate the approach, showing significant improvements in data efficiency for high-dimensional systems with many stable modes.
Abstract
We study the problem of stabilizing an unknown partially observable linear time-invariant (LTI) system. For fully observable systems, leveraging an unstable/stable subspace decomposition approach, state-of-art sample complexity is independent from system dimension $n$ and only scales with respect to the dimension of the unstable subspace. However, it remains open whether such sample complexity can be achieved for partially observable systems because such systems do not admit a uniquely identifiable unstable subspace. In this paper, we propose LTS-P, a novel technique that leverages compressed singular value decomposition (SVD) on the ''lifted'' Hankel matrix to estimate the unstable subsystem up to an unknown transformation. Then, we design a stabilizing controller that integrates a robust stabilizing controller for the unstable mode and a small-gain-type assumption on the stable subspace. We show that LTS-P stabilizes unknown partially observable LTI systems with state-of-the-art sample complexity that is dimension-free and only scales with the number of unstable modes, which significantly reduces data requirements for high-dimensional systems with many stable modes.
