Decentralized Event-Triggered Online Learning for Safe Consensus of Multi-Agent Systems with Gaussian Process Regression
Xiaobing Dai, Zewen Yang, Mengtian Xu, Fangzhou Liu, Georges Hattab, Sandra Hirche
TL;DR
The paper tackles average consensus in multi-agent systems with unknown dynamics by integrating a distributed controller with an auxiliary dynamics and Gaussian Process-based online learning to compensate the unknown term $f(\cdot)$. It introduces a decentralized event-triggered mechanism that selectively augments the GP training data, yielding probabilistic bounds on prediction errors and stability via a Lyapunov framework. The main theoretical result proves a probabilistic $\epsilon$-average consensus with $\epsilon = 2 c^{-1} N \underline{\eta}$ and high-confidence guarantees, while simulations and Monte Carlo tests show improved performance and data efficiency over offline learning. The approach offers a safe, scalable solution for learning-based coordination in MAS under uncertainties, with tunable speed and accuracy through separate control gains and learning parameters.
Abstract
Consensus control in multi-agent systems has received significant attention and practical implementation across various domains. However, managing consensus control under unknown dynamics remains a significant challenge for control design due to system uncertainties and environmental disturbances. This paper presents a novel learning-based distributed control law, augmented by an auxiliary dynamics. Gaussian processes are harnessed to compensate for the unknown components of the multi-agent system. For continuous enhancement in predictive performance of Gaussian process model, a data-efficient online learning strategy with a decentralized event-triggered mechanism is proposed. Furthermore, the control performance of the proposed approach is ensured via the Lyapunov theory, based on a probabilistic guarantee for prediction error bounds. To demonstrate the efficacy of the proposed learning-based controller, a comparative analysis is conducted, contrasting it with both conventional distributed control laws and offline learning methodologies.
