Robust SAC-Enabled UAV-RIS Assisted Secure MISO Systems With Untrusted EH Receivers
Hamid Reza Hashempour, Le-Nam Tran, Duy H. N. Nguyen, Hien Quoc Ngo
TL;DR
This work addresses secure downlink transmission in a UAV-mounted RIS-assisted multiuser MISO system with untrusted energy-harvesting receivers, formulating a worst-case secrecy-energy efficiency (WCSEE) objective under imperfect CSI and discrete RIS phases. A soft actor-critic (SAC) framework is proposed to jointly optimize UAV position, RIS phase shifts, and ZF power allocation, showing superior robustness to CSI uncertainty and faster convergence than baselines. As a benchmark, a tailored SCA-based solution is developed for the ideal PCSI case with continuous RIS phases, providing analytical insights and a quantitative performance reference. Results indicate the SAC approach yields notable WCSEE gains (up to ~28% over SCA and ~16% over DDPG in ideal scenarios) and robust performance across transmit-power levels and RIS sizes, underscoring the potential of learning-based strategies for complex, nonconvex UAV-RIS security problems. The work paves the way for practical secure UAV-RIS designs and motivates future extensions to multi-UAV systems and time-varying environments.
Abstract
This paper investigates secure downlink transmission in a UAV-assisted reconfigurable intelligent surface (RIS)-enabled multiuser multiple-input single-output network, where legitimate information-harvesting receivers coexist with untrusted energy-harvesting receivers (UEHRs) capable of eavesdropping. A UAV-mounted RIS provides blockage mitigation and passive beamforming, while the base station employs zero-forcing precoding for multiuser interference suppression. Due to limited feedback from UEHRs, their channel state information (CSI) is imperfect, leading to a worst-case secrecy energy efficiency (WCSEE) maximization problem. We jointly optimize the UAV horizontal position, RIS phase shifts, and transmit power allocation under both perfect and imperfect CSI, considering discrete RIS phases, UAV mobility, and energy-harvesting constraints. The resulting problem is highly nonconvex due to coupled channel geometry, robustness requirements, and discrete variables. To address this challenge, we propose a soft actor-critic (SAC)-based deep reinforcement learning framework that learns WCSEE-maximizing policies through interaction with the wireless environment. As a structured benchmark, a successive convex approximation (SCA) approach is developed for the perfect CSI case with continuous RIS phases. Simulation results show that the proposed SAC method achieves up to 28% and 16% secrecy energy efficiency gains over SCA and deep deterministic policy gradient baselines, respectively, while demonstrating superior robustness to CSI uncertainty and stable performance across varying transmit power levels and RIS sizes.
