Lyapunov-guided Multi-Agent Reinforcement Learning for Delay-Sensitive Wireless Scheduling

Cheng Zhang; Lan Wei; Ji Fan; Zening Liu; Yongming Huang

Lyapunov-guided Multi-Agent Reinforcement Learning for Delay-Sensitive Wireless Scheduling

Cheng Zhang, Lan Wei, Ji Fan, Zening Liu, Yongming Huang

TL;DR

A two-stage intelligent scheduler is proposed to minimize the packet-level delay jitter while guaranteeing delay bound, and a hierarchical scheme is proposed to solve the resource allocation between multiple base stations and users.

Abstract

In this paper, a two-stage intelligent scheduler is proposed to minimize the packet-level delay jitter while guaranteeing delay bound. Firstly, Lyapunov technology is employed to transform the delay-violation constraint into a sequential slot-level queue stability problem. Secondly, a hierarchical scheme is proposed to solve the resource allocation between multiple base stations and users, where the multi-agent reinforcement learning (MARL) gives the user priority and the number of scheduled packets, while the underlying scheduler allocates the resource. Our proposed scheme achieves lower delay jitter and delay violation rate than the Round-Robin Earliest Deadline First algorithm and MARL with delay violation penalty.

Lyapunov-guided Multi-Agent Reinforcement Learning for Delay-Sensitive Wireless Scheduling

TL;DR

Abstract

Lyapunov-guided Multi-Agent Reinforcement Learning for Delay-Sensitive Wireless Scheduling

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)