Deep Reinforcement Learning for Interference Suppression in RIS-Aided Space-Air-Ground Integrated Networks
Pujitha Mamillapalli, Shikhar Verma, Tiago Koketsu Rodrigues, Abhinav Kumar
TL;DR
The paper tackles cross-tier interference in RIS-aided SAGINs under shared-spectrum operation, proposing a RIS-assisted HAPS framework and a Deep Deterministic Policy Gradient (DDPG) approach to learn beamforming that forms nulls toward interference while preserving QoS. By modeling the system as a continuous-control task, the authors train actor–critic networks to adjust the HAPS beamforming and RIS phases in response to rapidly changing channels, achieving up to $11.3\%$ throughput gains for a $4\times4$ RIS over conventional zero-forcing beamforming. The results demonstrate improved spectral efficiency and energy usage in dynamic non-terrestrial networks across different RIS configurations and user distributions, validating the adaptability of DRL-based interference suppression. The work suggests scalable deployment potential and outlines future directions addressing larger networks, hardware impairments, and imperfect CSI to bridge toward real-world non-terrestrial 6G systems.
Abstract
Future 6G networks envision ubiquitous connectivity through space-air-ground integrated networks (SAGINs), where high-altitude platform stations (HAPSs) and satellites complement terrestrial systems to provide wide-area, low-latency coverage. However, the rapid growth of terrestrial devices intensifies spectrum sharing between terrestrial and non-terrestrial segments, resulting in severe cross-tier interference. In particular, frequency sharing between the HAPS satellite uplink and HAPS ground downlink improves spectrum efficiency but suffers from interference caused by the HAPS antenna back-lobe. Existing approaches relying on zero-forcing (ZF) codebooks have limited performance under highly dynamic channel conditions. To overcome this limitation, we employ a reconfigurable intelligent surface (RIS)-aided HAPS-based SAGIN framework with a deep deterministic policy gradient (DDPG) algorithm. The proposed DDPG framework optimizes the HAPS beamforming weights to form spatial nulls toward interference sources while maintaining robust links to the desired signals. Simulation results demonstrate that the DDPG framework consistently outperforms conventional ZF beamforming among different RIS configurations, achieving up to \(11.3\%\) throughput improvement for a \(4\times4\) RIS configuration, validating its adaptive capability to enhance spectral efficiency in dynamic HAPS-based SAGINs.
