Reinforcement Learning-based Relay Selection for Cooperative WSNs in the Presence of Bursty Impulsive Noise
Hazem Barka, Md Sahabul Alam, Georges Kaddoum, Minh Au, Basile L. Agba
TL;DR
The paper tackles relay selection in cooperative WSNs with Decode-and-Forward relays under bursty impulsive noise with memory modeled as a Two-state Markov Gaussian process, focusing on Symbol-Error-Rate and battery fairness. It introduces two strategies: (i) a noise-state-aware Max-Min criterion augmented with residual battery metrics to improve SER and balance energy consumption, and (ii) a reinforcement-learning approach using policy-gradient (REINFORCE) to learn relay selection in an MDP where states encode channel gains, impulse activity, and battery levels. Experimental results show the modified Max-Min can approach AWGN-like SER while achieving near-equal remaining energy among relays, and the RL method can match AWGN/TSMG benchmarks and exploit impulsive-noise patterns for improved performance and fairness. The methods are applicable to multi-relay scenarios and can accommodate additional WSN constraints, enhancing reliability and energy efficiency in practical smart-grid-like environments.
Abstract
The problem of relay selection is pivotal in the realm of cooperative communication. However, this issue has not been thoroughly examined, particularly when the background noise is assumed to possess an impulsive characteristic with consistent memory as observed in smart grid communications and some other wireless communication scenarios. In this paper, we investigate the impact of this specific type of noise on the performance of cooperative Wireless Sensor Networks (WSNs) with the Decode and Forward (DF) relaying scheme, considering Symbol-Error-Rate (SER) and battery power consumption fairness across all nodes as the performance metrics. We introduce two innovative relay selection methods that depend on noise state detection and the residual battery power of each relay. The first method encompasses the adaptation of the Max-Min criterion to this specific context, whereas the second employs Reinforcement Learning (RL) to surmount this challenge. Our empirical outcomes demonstrate that the impacts of bursty impulsive noise on the SER performance can be effectively mitigated and that a balance in battery power consumption among all nodes can be established using the proposed methods.
