Robust synchronization and policy adaptation for networked heterogeneous agents
Miguel F. Arevalo-Castiblanco, Eduardo Mojica-Nava and, César A. Uribe
TL;DR
This paper tackles robust synchronization of leader–follower networks composed of nonlinear heterogeneous agents in the presence of model uncertainties and actuator saturation. It introduces DMSAC-RL, which augments pre-trained RL policies with a distributed adaptive inner loop to handle mismatch and saturations, and extends it to followers via a distributed MRAC framework. Lyapunov-based analysis proves Uniformly Ultimately Bounded ($UUB$) synchronization errors for both leader and follower dynamics in MIMO settings, with input magnitude saturation addressed in an augmented control law. Numerical experiments on pendulum networks and saturated linear MIMO systems demonstrate improved robustness and synchronization compared with RL-alone policies, highlighting practical impact for data-driven control in heterogeneous multi-agent systems.
Abstract
We propose a robust adaptive online synchronization method for leader-follower networks of nonlinear heterogeneous agents with system uncertainties and input magnitude saturation. Synchronization is achieved using a Distributed input Magnitude Saturation Adaptive Control with Reinforcement Learning (DMSAC-RL), which improves the empirical performance of policies trained on off-the-shelf models using Reinforcement Learning (RL) strategies. The leader observes the performance of a reference model, and followers observe the states and actions of the agents they are connected to, but not the reference model. The leader and followers may differ from the reference model in which the RL control policy was trained. DMSAC-RL uses an internal loop that adjusts the learned policy for the agents in the form of augmented input to solve the distributed control problem, including input-matched uncertainty parameters. We show that the synchronization error of the heterogeneous network is Uniformly Ultimately Bounded (UUB). Numerical analysis of a network of Multiple Input Multiple Output (MIMO) systems supports our theoretical findings.
