Close-Proximity Satellite Operations through Deep Reinforcement Learning and Terrestrial Testing Environments
Henry Lei, Joshua Aurand, Zachary S. Lippay, Sean Phillips
TL;DR
The paper addresses safe autonomous satellite operations in crowded space by deploying a DRL-based low-level controller within a hierarchical framework and evaluating safety via a runtime assurance (RTA) module. It uses the LINCS Lab to compare training, simulation, and hardware-in-the-loop performance, highlighting sim-to-real gaps and the impact of RTA on mission efficiency. Across three experimental sets, results show strong baseline performance in simulation but notable degradation in hardware environments, especially under high-risk, close-proximity scenarios. The work emphasizes the importance of robust training, realistic dynamics, and safety-first control architectures for practical autonomous satellite operations, guiding future improvements in sim2real robustness and safety integration.
Abstract
With the increasingly congested and contested space environment, safe and effective satellite operation has become increasingly challenging. As a result, there is growing interest in autonomous satellite capabilities, with common machine learning techniques gaining attention for their potential to address complex decision-making in the space domain. However, the "black-box" nature of many of these methods results in difficulty understanding the model's input/output relationship and more specifically its sensitivity to environmental disturbances, sensor noise, and control intervention. This paper explores the use of Deep Reinforcement Learning (DRL) for satellite control in multi-agent inspection tasks. The Local Intelligent Network of Collaborative Satellites (LINCS) Lab is used to test the performance of these control algorithms across different environments, from simulations to real-world quadrotor UAV hardware, with a particular focus on understanding their behavior and potential degradation in performance when deployed beyond the training environment.
