Secure Energy Efficient Wireless Transmission: A Finite v/s Infinite-Horizon RL Solution
Shalini Tripathi, Ankur Bansal, Holger Claussen, Lester Ho, Chinmoy Kundu
TL;DR
This work tackles secure wireless transmission with energy harvesting at both the source and a full-duplex destination, aiming to maximize the average secrecy energy efficiency over a finite horizon. It casts the joint transmit and jamming power allocation as a finite-horizon MDP and develops FHJPA, a backward-induction algorithm, along with a low-complexity greedy baseline and an infinite-horizon IHJPA for comparison. The results show FHJPA delivers the best average SEE across a range of horizons, with IHJPA approaching FHJPA as the horizon grows, while GA becomes competitive when the source battery is energy-rich. The study highlights a fundamental trade-off between maximizing SEE and total secure bits, and reports a 16.6% reduction in computation time for FHJPA relative to IHJPA, underscoring practical scalability of the finite-horizon approach.
Abstract
In this paper, a joint optimal allocation of transmit power at the source and jamming power at the destination is proposed to maximize the average secrecy energy efficiency (SEE) of a wireless network within a finite time duration. The destination transmits the jamming signal to improve secrecy by utilizing full-duplex capability. The source and destination both have energy harvesting (EH) capability with limited battery capacity. Due to the Markov nature of the system, the problem is formulated as a finite-horizon reinforcement learning (RL) problem. We propose the finite-horizon joint power allocation (FHJPA) algorithm for the finite-horizon RL problem and compare it with a low-complexity greedy algorithm (GA). An infinite-horizon joint power allocation (IHJPA) algorithm is also proposed for the corresponding infinite-horizon problem. A comparative analysis of these algorithms is carried out in terms of SEE, expected total transmitted secure bits, and computational complexity. The results show that the FHJPA algorithm outperforms the GA and IHJPA algorithms due to its appropriate modelling in finite horizon transmission. When the source node battery has sufficient energy, the GA can yield performance close to the FHJPA algorithm despite its low-complexity. When the transmission time horizon increases, the accuracy of the infinite-horizon model improves, resulting in a reduced performance gap between FHJPA and IHJPA algorithms. The computational time comparison shows that the FHJPA algorithm takes $16.6$ percent less time than the IHJPA algorithm.
