Securing SIM-Assisted Wireless Networks via Quantum Reinforcement Learning

Le-Hung Hoang; Quang-Trung Luu; Dinh Thai Hoang; Diep N. Nguyen; Van-Dinh Nguyen

Securing SIM-Assisted Wireless Networks via Quantum Reinforcement Learning

Le-Hung Hoang, Quang-Trung Luu, Dinh Thai Hoang, Diep N. Nguyen, Van-Dinh Nguyen

TL;DR

A hybrid quantum proximal policy optimization (Q-PPO) framework for SIM-assisted secure communications is proposed, which jointly optimizes transmit power allocation and SIM phase shifts to maximize the average secrecy rate under power and quality-of-service constraints.

Abstract

Stacked intelligent metasurfaces (SIMs) have recently emerged as a powerful wave-domain technology that enables multi-stage manipulation of electromagnetic signals through multilayer programmable architectures. While SIMs offer unprecedented degrees of freedom for enhancing physical-layer security, their extremely large number of meta-atoms leads to a high-dimensional and strongly coupled optimization space, making conventional design approaches inefficient and difficult to scale. Moreover, existing deep reinforcement learning (DRL) techniques suffer from slow convergence and performance degradation in dynamic wireless environments with imperfect knowledge of passive eavesdroppers. To overcome these challenges, we propose a hybrid quantum proximal policy optimization (Q-PPO) framework for SIM-assisted secure communications, which jointly optimizes transmit power allocation and SIM phase shifts to maximize the average secrecy rate under power and quality-of-service constraints. Specifically, a parameterized quantum circuit is embedded into the actor network, forming a hybrid classical-quantum policy architecture that enhances policy representation capability and exploration efficiency in high-dimensional continuous action spaces. Extensive simulations demonstrate that the proposed Q-PPO scheme consistently outperforms DRL baselines, achieving approximately 15% higher secrecy rates and 30% faster convergence under imperfect eavesdropper channel state information. These results establish Q-PPO as a powerful optimization paradigm for SIM-enabled secure wireless networks.

Securing SIM-Assisted Wireless Networks via Quantum Reinforcement Learning

TL;DR

Abstract

Paper Structure (22 sections, 41 equations, 6 figures, 1 table, 1 algorithm)

This paper contains 22 sections, 41 equations, 6 figures, 1 table, 1 algorithm.

Introduction
Related Works
Motivations and Main Contributions
System Model and Problem Formulation
SIM Model
Channel Model
Problem Formulation
Stochastic Optimization Problem Formulation
Quantum Reinforcement Learning-based Solution
Preliminaries of Reinforcement Learning
Proposed Q-PPO Algorithm
Quantum Computing-enhanced Reinforcement Learning
Parameterized Quantum Circuit
Quantum Measurement
Pre-Encoding and Post-Processing
...and 7 more sections

Figures (6)

Figure 1: The SIM-aided secure multi-user communication system model.
Figure 2: The proposed hybrid quantum-classical Q-PPO framework.
Figure 3: The convergence of the different algorithms.
Figure 4: Comparison of proposed Q-PPO and benchmark schemes (PPO and Random), where we consider varying (a) learning rate $lr$, (b) number of available qubits $q$ and (c) number of PQC layers $\eta$.
Figure 5: Evaluation of the average secrecy rate with varying (a) number of meta atoms per layers ($N$), (b) number of SIM layers ($L$), (c) maximum transmit power ($P_0$), (d) minimum data rate requirement ($R_{\min}$), (e) number of CUs ($M$), and (f) CUs/Eva distance ($d_{\epsilon}$).
...and 1 more figures

Securing SIM-Assisted Wireless Networks via Quantum Reinforcement Learning

TL;DR

Abstract

Securing SIM-Assisted Wireless Networks via Quantum Reinforcement Learning

Authors

TL;DR

Abstract

Table of Contents

Figures (6)