Sensing-Assisted Adaptive Channel Contention for Mobile Delay-Sensitive Communications
Bojie Lv, Qianren Li, Rui Wang
TL;DR
This work tackles delay-sensitive uplink scheduling in sensing-enabled mmWave networks by formulating a decentralized multi-agent MDP in which mobile agents locally map their state to back-off parameters. A centralized optimization of policy parameters $\mathbf{b}$ and $\bm{\lambda}$ is solved using an unbiased SGD gradient estimator that leverages environment and mobility sensing, including LoS blockage predictions, to minimize a discounted queueing cost. Key contributions include the P1 formulation, an analytical back-off policy model, unbiased gradient estimation via a single trajectory, a provable convergence guarantee for the SLPG algorithm, and simulations showing superior queuing performance and efficiency over SPSA and baseline policies. The approach delivers practical gains in reduced delays and buffer overflow in mobile mmWave uplink scenarios, by integrating sensing-based channel predictions into adaptive contention and scheduling.
Abstract
This paper proposes an adaptive channel contention mechanism to optimize the queuing performance of a distributed millimeter wave (mmWave) uplink system with the capability of environment and mobility sensing. The mobile agents determine their back-off timer parameters according to their local knowledge of the uplink queue lengths, channel quality, and future channel statistics, where the channel prediction relies on the environment and mobility sensing. The optimization of queuing performance with this adaptive channel contention mechanism is formulated as a decentralized multi-agent Markov decision process (MDP). Although the channel contention actions are determined locally at the mobile agents, the optimization of local channel contention policies of all mobile agents is conducted in a centralized manner according to the system statistics before the scheduling. In the solution, the local policies are approximated by analytical models, and the optimization of their parameters becomes a stochastic optimization problem along an adaptive Markov chain. An unbiased gradient estimation is proposed so that the local policies can be optimized efficiently via the stochastic gradient descent method. It is demonstrated by simulation that the proposed gradient estimation is significantly more efficient in optimization than the existing methods, e.g., simultaneous perturbation stochastic approximation (SPSA).
