Noise Sensitivity of the Semidefinite Programs for Direct Data-Driven LQR
Xiong Zeng, Laurent Bako, Necmiye Ozay
TL;DR
The paper reveals a fundamental instability in direct data-driven LQR formulations: when data are contaminated by noise, the certaintyequivalence SDP and its robustness-promoting variant both yield trivial zero-gain controllers in the large-sample limit, undermining statistical consistency. By reformulating the CE and RP problems and applying the Fundamental Lemma with persistently exciting data, the authors show that the optimal solutions satisfy linear relations that force the feedback gain to vanish with probability one in the presence of noise. Numerical experiments on a second-order unstable system corroborate the theory, showing zero gains for CE under noise and similar asymptotic behavior for RP with fixed regularization, though increasing regularization with data length can prevent collapse. The work underscores the need for truly robust, statistically sound direct data-driven control methods beyond naive CE or fixed-regularization RP formulations, with implications for the design and analysis of data-driven controllers in noisy environments.
Abstract
In this paper, we study the noise sensitivity of the semidefinite program (SDP) proposed for direct data-driven infinite-horizon linear quadratic regulator (LQR) problem for discrete-time linear time-invariant systems. While this SDP is shown to find the true LQR controller in the noise-free setting, we show that it leads to a trivial solution with zero gain matrices when data is corrupted by noise, even when the noise is arbitrarily small. We then study a variant of the SDP that includes a robustness promoting regularization term and prove that regularization does not fully eliminate the sensitivity issue. In particular, the solution of the regularized SDP converges in probability also to a trivial solution.
