Towards Adaptive IMFs -- Generalization of utility functions in Multi-Agent Frameworks
Kaushik Dey, Satheesh K. Perepu, Abir Das, Pallab Dasgupta
TL;DR
The paper tackles the need for Adaptive Intent Management Functions (IMFs) that can handle changing customer intents and priorities without retraining. It introduces a supervisor policy $\pi_g$ that coordinates multiple MARL subsystems and augments it with a DUN-based feature engineering module to encode the utility context, enabling runtime adaptation to different utility function forms such as linear, logarithmic, and quadratic. The approach preserves a fixed training regime while allowing execution-time changes in the function form and priorities, demonstrated via a GlobeCom network emulator across diverse scenarios, including extended action spaces like Move-UE-context and Auto-Scale. Results indicate improved generalization (lower Integral Absolute Error) and faster convergence compared with baselines, suggesting significant practical impact for live, resource-constrained 6G networks.
Abstract
Intent Management Function (IMF) is an integral part of future-generation networks. In recent years, there has been some work on AI-based IMFs that can handle conflicting intents and prioritize the global objective based on apriori definition of the utility function and accorded priorities for competing intents. Some of the earlier works use Multi-Agent Reinforcement Learning (MARL) techniques with AdHoc Teaming (AHT) approaches for efficient conflict handling in IMF. However, the success of such frameworks in real-life scenarios requires them to be flexible to business situations. The intent priorities can change and the utility function, which measures the extent of intent fulfilment, may also vary in definition. This paper proposes a novel mechanism whereby the IMF can generalize to different forms of utility functions and change of intent priorities at run-time without additional training. Such generalization ability, without additional training requirements, would help to deploy IMF in live networks where customer intents and priorities change frequently. Results on the network emulator demonstrate the efficacy of the approach, scalability for new intents, outperforming existing techniques that require additional training to achieve the same degree of flexibility thereby saving cost, and increasing efficiency and adaptability.
