On the Necessity of Two-Stage Estimation for Learning Dynamical Systems under Both Noise and Node-Wise Attacks
Jihun Kim, Javad Lavaei
TL;DR
The paper addresses learning a networked linear dynamical system from a single trajectory under persistent noise and node-wise adversarial attacks with probability $p<0.5$. It proves that any convex one-stage estimator cannot consistently recover the true dynamics $ar{A}$ in this setting, and it introduces a robust two-stage approach: Stage I uses a row-wise $ oldsymbol{ ext{l}}_{1}$-norm estimator to detect and filter attacked samples, followed by Stage II least-squares on the cleaned data. The authors derive non-asymptotic bounds showing Stage I error scales with $( obreak \sigma_w+ obreak obreak obreak obreak obreak obreak obreak )^4 obreak $ and the Stage II error decays as $Oigl(rac{n}{T}igr)^{1/2} rac{ obreak ( }{ } $ plus a bias term due to misclassifications, with perfect separability ($v_t^{(i)}$ large relative to noise) yielding consistency. The work thus provides the first non-asymptotic guarantees that a two-stage estimation framework is necessary for robust system identification under simultaneous persistent noise and adversarial attacks, with practical implications for filtering-based robustness in distributed networks.
Abstract
The least-squares estimator has achieved considerable success in learning linear dynamical systems from a single trajectory of length $T$. While it attains an optimal error of $\mathcal{O}(1/\sqrt{T})$ under independent zero-mean noise, it lacks robustness and is particularly susceptible to adversarial corruption. In this paper, we consider the identification of a networked system in which every node is subject to both noise and adversarial attacks. We assume that every node is independently corrupted with probability smaller than $0.5$ at each time, placing the overall system under almost-persistent local attack. We first show that no convex one-stage estimator can achieve a consistent estimate as $T$ grows under both noise and attacks. This motivates the development of a two-stage estimation method applied across nodes. In Stage I, we leverage the $\ell_1$-norm estimator and derive an estimation error bound proportional to the noise level $σ_w$. This bound is subsequently used to detect and filter out attacks, producing a clean dataset for each node, to which we apply the least-squares estimator in Stage II. The resulting estimation error is on the order $\mathcal{O}(1/\sqrt{T})$ plus the product of $σ_w$ and the number of misclassifications. In the event of perfect separability between attack and non-attack data, which occurs when injected attacks are sufficiently large relative to the noise scale, our two-stage estimator is consistent for the true system.
