Stability Analysis of Deep Reinforcement Learning for Multi-Agent Inspection in a Terrestrial Testbed
Henry Lei, Zachary S. Lippay, Anonto Zaman, Joshua Aurand, Amin Maghareh, Sean Phillips
TL;DR
The paper addresses robust autonomous multi-agent satellite inspection under modeling uncertainties and limited communications. It proposes a hierarchical DRL framework with a high-level guidance planner and a low-level motion controller, augmented by runtime assurance using barrier-based methods, evaluated on the LINCS platform across four fidelity levels. Key contributions include a formal problem formulation with $CWH$ dynamics and $J_2$ perturbations, an integrated RL-based guidance and control architecture, and comprehensive experiments showing high task completion rates with controlled degradation in time and distance as fidelity increases. These results demonstrate the potential for scalable, robust autonomous satellite operations and provide a practical path for bridging the sim-to-real gap in space missions.
Abstract
The design and deployment of autonomous systems for space missions require robust solutions to navigate strict reliability constraints, extended operational duration, and communication challenges. This study evaluates the stability and performance of a hierarchical deep reinforcement learning (DRL) framework designed for multi-agent satellite inspection tasks. The proposed framework integrates a high-level guidance policy with a low-level motion controller, enabling scalable task allocation and efficient trajectory execution. Experiments conducted on the Local Intelligent Network of Collaborative Satellites (LINCS) testbed assess the framework's performance under varying levels of fidelity, from simulated environments to a cyber-physical testbed. Key metrics, including task completion rate, distance traveled, and fuel consumption, highlight the framework's robustness and adaptability despite real-world uncertainties such as sensor noise, dynamic perturbations, and runtime assurance (RTA) constraints. The results demonstrate that the hierarchical controller effectively bridges the sim-to-real gap, maintaining high task completion rates while adapting to the complexities of real-world environments. These findings validate the framework's potential for enabling autonomous satellite operations in future space missions.
