Intent-based Radio Scheduler for RAN Slicing: Learning to deal with different network scenarios
Cleverson Nahum, Salvatore D'Oro, Pedro Batista, Cristiano Both, Kleber Cardoso, Aldebaro Klautau, Tommaso Melodia
TL;DR
This work tackles the challenge of robust radio resource scheduling in RAN slicing under diverse network scenarios by introducing an intent-based RRS powered by multi-agent reinforcement learning. The architecture splits inter-slice scheduling (PPO-based) from intra-slice scheduling (shared-parameter MARL), and aligns decisions with slice intents through an enhanced intent-drift reward. It demonstrates strong protection for high-priority slices and improved overall intent satisfaction across multiple network scenarios, while showing that generalization across unseen scenarios remains difficult and can be aided by transfer learning, which substantially reduces training time. The results suggest that an intent-aware MARL framework, coupled with transfer learning, is a viable path toward production-ready, adaptable RAN slicing in 6G-era networks.
Abstract
The future mobile network has the complex mission of distributing available radio resources among various applications with different requirements. The radio access network slicing enables the creation of different logical networks by isolating and using dedicated resources for each group of applications. In this scenario, the radio resource scheduling (RRS) is responsible for distributing the radio resources available among the slices to fulfill their service-level agreement (SLA) requirements, prioritizing critical slices while minimizing the number of intent violations. Moreover, ensuring that the RRS can deal with a high diversity of network scenarios is essential. Several recent papers present advances in machine learning-based RRS. However, the scenarios and slice variety are restricted, which inhibits solid conclusions about the generalization capabilities of the models after deployment in real networks. This paper proposes an intent-based RRS using multi-agent reinforcement learning in a radio access network (RAN) slicing context. The proposed method protects high-priority slices when the available radio resources cannot fulfill all the slices. It uses transfer learning to reduce the number of training steps required. The proposed method and baselines are evaluated in different network scenarios that comprehend combinations of different slice types, channel trajectories, number of active slices and users' equipment (UEs), and UE characteristics. The proposed method outperformed the baselines in protecting slices with higher priority, obtaining an improvement of 40% and, when considering all the slices, obtaining an improvement of 20% in relation to the baselines. The results show that by using transfer learning, the required number of training steps could be reduced by a factor of eight without hurting performance.
