An open source Multi-Agent Deep Reinforcement Learning Routing Simulator for satellite networks
Federico Lozano-Cuadra, Mathias D. Thorsager, Israel Leyva-Mayorga, Beatriz Soret
TL;DR
This work tackles robust packet routing in Low Earth Orbit satellite constellations under movement and traffic uncertainties. It introduces an open-source Python-based simulator with a discrete-event engine that supports Dijkstra-based routing, Q-Routing, and MA-DRL on a time-varying graph $\mathcal{G}_t(\mathcal{N}, \mathcal{E})$. The platform offers configurable constellations (including Walker delta/star designs and real deployments like Kepler, Iridium Next, OneWeb, and Starlink), realistic data-rate models, and visualization/post-processing tools. Experiments show RL-based routing reduces end-to-end latency relative to traditional shortest-path policies, demonstrating practical impact for future satellite networks.
Abstract
This paper introduces an open source simulator for packet routing in Low Earth Orbit Satellite Constellations (LSatCs) considering the dynamic system uncertainties. The simulator, implemented in Python, supports traditional Dijkstra's based routing as well as more advanced learning solutions, specifically Q-Routing and Multi-Agent Deep Reinforcement Learning (MA-DRL) from our previous work. It uses an event-based approach with the SimPy module to accurately simulate packet creation, routing and queuing, providing real-time tracking of queues and latency. The simulator is highly configurable, allowing adjustments in routing policies, traffic, ground and space layer topologies, communication parameters, and learning hyperparameters. Key features include the ability to visualize system motion and track packet paths. Results highlight significant improvements in end-to-end (E2E) latency using Reinforcement Learning (RL)-based routing policies compared to traditional methods. The source code, the documentation and a Jupyter notebook with post-processing results and analysis are available on GitHub.
