AutoBS: Autonomous Base Station Deployment with Reinforcement Learning and Digital Network Twins
Ju-Hyung Lee, Andreas F. Molisch
TL;DR
AutoBS presents a PPO-based deep reinforcement learning framework integrated with a PMNet digital network twin to autonomously deploy base stations in 6G RANs. By modeling deployment as an MDP and using fast pathloss predictions for reward evaluation, AutoBS achieves near-optimal capacity (up to ~95% of exhaustive search) with inference times in milliseconds, enabling real-time, scalable optimization for dense urban networks. The approach supports both static single-BS and asynchronous multi-BS deployment, demonstrating strong performance gains over heuristic baselines and substantial reductions in computation time. This work offers a practical, adaptable framework for large-scale network topology optimization and can extend to related tasks such as mobility management and energy-efficient BS operation.
Abstract
This paper introduces AutoBS, a reinforcement learning (RL)-based framework for optimal base station (BS) deployment in 6G radio access networks (RAN). AutoBS leverages the Proximal Policy Optimization (PPO) algorithm and fast, site-specific pathloss predictions from PMNet-a generative model for digital network twins (DNT). By efficiently learning deployment strategies that balance coverage and capacity, AutoBS achieves about 95% of the capacity of exhaustive search in single BS scenarios (and in 90% for multiple BSs), while cutting inference time from hours to milliseconds, making it highly suitable for real-time applications (e.g., ad-hoc deployments). AutoBS therefore provides a scalable, automated solution for large-scale 6G networks, meeting the demands of dynamic environments with minimal computational overhead.
