Robust SAC-Enabled UAV-RIS Assisted Secure MISO Systems With Untrusted EH Receivers

Hamid Reza Hashempour; Le-Nam Tran; Duy H. N. Nguyen; Hien Quoc Ngo

Robust SAC-Enabled UAV-RIS Assisted Secure MISO Systems With Untrusted EH Receivers

Hamid Reza Hashempour, Le-Nam Tran, Duy H. N. Nguyen, Hien Quoc Ngo

TL;DR

This work addresses secure downlink transmission in a UAV-mounted RIS-assisted multiuser MISO system with untrusted energy-harvesting receivers, formulating a worst-case secrecy-energy efficiency (WCSEE) objective under imperfect CSI and discrete RIS phases. A soft actor-critic (SAC) framework is proposed to jointly optimize UAV position, RIS phase shifts, and ZF power allocation, showing superior robustness to CSI uncertainty and faster convergence than baselines. As a benchmark, a tailored SCA-based solution is developed for the ideal PCSI case with continuous RIS phases, providing analytical insights and a quantitative performance reference. Results indicate the SAC approach yields notable WCSEE gains (up to ~28% over SCA and ~16% over DDPG in ideal scenarios) and robust performance across transmit-power levels and RIS sizes, underscoring the potential of learning-based strategies for complex, nonconvex UAV-RIS security problems. The work paves the way for practical secure UAV-RIS designs and motivates future extensions to multi-UAV systems and time-varying environments.

Abstract

This paper investigates secure downlink transmission in a UAV-assisted reconfigurable intelligent surface (RIS)-enabled multiuser multiple-input single-output network, where legitimate information-harvesting receivers coexist with untrusted energy-harvesting receivers (UEHRs) capable of eavesdropping. A UAV-mounted RIS provides blockage mitigation and passive beamforming, while the base station employs zero-forcing precoding for multiuser interference suppression. Due to limited feedback from UEHRs, their channel state information (CSI) is imperfect, leading to a worst-case secrecy energy efficiency (WCSEE) maximization problem. We jointly optimize the UAV horizontal position, RIS phase shifts, and transmit power allocation under both perfect and imperfect CSI, considering discrete RIS phases, UAV mobility, and energy-harvesting constraints. The resulting problem is highly nonconvex due to coupled channel geometry, robustness requirements, and discrete variables. To address this challenge, we propose a soft actor-critic (SAC)-based deep reinforcement learning framework that learns WCSEE-maximizing policies through interaction with the wireless environment. As a structured benchmark, a successive convex approximation (SCA) approach is developed for the perfect CSI case with continuous RIS phases. Simulation results show that the proposed SAC method achieves up to 28% and 16% secrecy energy efficiency gains over SCA and deep deterministic policy gradient baselines, respectively, while demonstrating superior robustness to CSI uncertainty and stable performance across varying transmit power levels and RIS sizes.

Robust SAC-Enabled UAV-RIS Assisted Secure MISO Systems With Untrusted EH Receivers

TL;DR

Abstract

Paper Structure (25 sections, 3 theorems, 84 equations, 7 figures, 1 table, 5 algorithms)

This paper contains 25 sections, 3 theorems, 84 equations, 7 figures, 1 table, 5 algorithms.

Introduction
System Model and Problem Formulation
Channel Model
Signal Model and ZF Beamforming
Worst-Case Secrecy Rate Under CSI Uncertainty
Problem Formulation
SAC-Based Joint ZF Power Allocation, RIS Phase, and Location Optimization
MDP Model
Fundamentals of the SAC Algorithm
SAC-Based Joint Optimization Framework
SCA-Based Solution
Power Allocation
RIS Reflection Phase Optimization
UAV Location Optimization
Channel Separation and SINR Representation
...and 10 more sections

Key Result

Proposition 1

An affine approximation of the secrecy-related constraints in Pris_sec, for all $k\in\mathcal{K}$ and $j\in\mathcal{J}$, is given by where $\Psi^{(t)}(\mathbf{s},\rho_k;\mathbf{t}_{kk})$ is the first-order Taylor approximation of $\frac{|\mathbf{t}_{kk}^H\mathbf{s}|^2}{\rho_k}$, $\Gamma^{(t)}(x)\triangleq 2^{x^{(t-1)}}[1+\ln(2)\,(x-x^{(t-1)})]$ is the first-order Taylor approximation of $2^x$, an

Figures (7)

Figure 1: System model of the UAV-assisted RIS-aided secure downlink network.
Figure 2: SAC framework for joint power allocation, RIS phase optimization, and UAV positioning with twin critics and target Q-networks.
Figure 3: Convergence behavior under the ideal setting at $P_{\mathrm{max}}=10$ dBm.
Figure 4: SEE versus transmit power for SCA, DDPG, and the proposed SAC-based scheme.
Figure 5: SEE versus the number of RIS elements $M$ at $P_{\mathrm{max}}=10$ dBm for different schemes.
...and 2 more figures

Theorems & Definitions (8)

Remark 1
Proposition 1
proof
Proposition 2
proof
Remark 2
Lemma 1
proof

Robust SAC-Enabled UAV-RIS Assisted Secure MISO Systems With Untrusted EH Receivers

TL;DR

Abstract

Robust SAC-Enabled UAV-RIS Assisted Secure MISO Systems With Untrusted EH Receivers

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (7)

Theorems & Definitions (8)