Online Learning for Autonomous Management of Intent-based 6G Networks
Erciyes Karakaya, Ozgur Ercetin, Huseyin Ozkan, Mehmet Karaca, Elham Dehghan Biyar, Alexandros Palaios
TL;DR
The paper tackles conflict-prone, intent-based management in autonomous 6G networks by proposing a hierarchical online learning framework based on multi-armed bandits. Child CLs independently target specific KPI intents, while a parent CL aggregates their local estimates to resolve cross-service conflicts using a Fed2-UCB strategy, enabling adaptive resource allocation under dynamic KPI targets and bandwidth limits. The authors formulate the problem with an objective to maximize the cumulative reward over a horizon while minimizing pseudo-regret $R(T)$ and present Algorithms 1 (Child CL) and 2 (MABCR) to realize the hierarchy. Through a Python-based emulator with URLLC and video services, they show that the MABCR approach outperforms greedy and Thompson sampling in SLA adherence and PRB efficiency, benefiting from the stabilizing influence of the parent evaluator. The work demonstrates a practical pathway to scalable, autonomous management of intent-based networks in 6G, with potential for real-world deployment and further exploration of network configurations.
Abstract
The growing complexity of networks and the variety of future scenarios with diverse and often stringent performance requirements call for a higher level of automation. Intent-based management emerges as a solution to attain high level of automation, enabling human operators to solely communicate with the network through high-level intents. The intents consist of the targets in the form of expectations (i.e., latency expectation) from a service and based on the expectations the required network configurations should be done accordingly. It is almost inevitable that when a network action is taken to fulfill one intent, it can cause negative impacts on the performance of another intent, which results in a conflict. In this paper, we aim to address the conflict issue and autonomous management of intent-based networking, and propose an online learning method based on the hierarchical multi-armed bandits approach for an effective management. Thanks to this hierarchical structure, it performs an efficient exploration and exploitation of network configurations with respect to the dynamic network conditions. We show that our algorithm is an effective approach regarding resource allocation and satisfaction of intent expectations.
