DRL-Based Beam Positioning for LEO Satellite Constellations with Weighted Least Squares
Po-Heng Chou, Chiapin Wang, Kuan-Hao Chen, Wei-Chen Hsiao
TL;DR
This work tackles real-time, CSI-free positioning in dynamic multi-beam LEO satellite systems by coupling a reinforcement-learning policy with an augmented weighted least squares estimator. The proposed DQN-WLS framework learns beam weighting directly from uplink pilot responses and beam geometry, enabling robust localization under interference and GNSS-denied conditions while maintaining low computational complexity. Empirical results show a 99.3% reduction in mean positioning error relative to a geometry-based baseline, achieving a 0.395 m RMSE with near real-time inference, demonstrating strong potential for onboard, autonomous beam management in NTN/LEO deployments.
Abstract
In this paper, we propose a reinforcement learning based beam weighting framework that couples a policy network with an augmented weighted least squares (WLS) estimator for accurate and low-complexity positioning in multi-beam LEO constellations. Unlike conventional geometry or CSI-dependent approaches, the policy learns directly from uplink pilot responses and geometry features, enabling robust localization without explicit CSI estimation. An augmented WLS jointly estimates position and receiver clock bias, improving numerical stability under dynamic beam geometry. Across representative scenarios, the proposed method reduces the mean positioning error by 99.3% compared with the geometry-based baseline, achieving 0.395 m RMSE with near real-time inference.
