Adversarially Robust Multitask Adaptive Control
Kasra Fallah, Leonardo F. Toso, James Anderson
TL;DR
The paper tackles adversarially robust multitask adaptive LQR control for heterogeneous, potentially compromised systems. It introduces a clustered multitask learning pipeline that clusters systems, performs robust multitask system identification within clusters, and uses resilient aggregation to suppress Byzantine updates. The authors derive non-asymptotic error and regret bounds showing that collaboration reduces regret roughly as rac{dT}{m_j}^{1/2} and that this benefit persists under bounded adversarial fractions, with additional terms capturing clustering misclassification and intra-cluster heterogeneity. Numerical experiments on simulated multi-robot dynamics validate the approach, illustrating improved performance over single-task CE control and robustness to adversarial behavior, with insights into the interaction between clustering accuracy, heterogeneity, and resilience.
Abstract
We study adversarially robust multitask adaptive linear quadratic control; a setting where multiple systems collaboratively learn control policies under model uncertainty and adversarial corruption. We propose a clustered multitask approach that integrates clustering and system identification with resilient aggregation to mitigate corrupted model updates. Our analysis characterizes how clustering accuracy, intra-cluster heterogeneity, and adversarial behavior affect the expected regret of certainty-equivalent (CE) control across LQR tasks. We establish non-asymptotic bounds demonstrating that the regret decreases inversely with the number of honest systems per cluster and that this reduction is preserved under a bounded fraction of adversarial systems within each cluster.
