Adaptive Resource Allocation for Virtualized Base Stations in O-RAN with Online Learning
Michail Kalntis, George Iosifidis, Fernando A. Kuipers
TL;DR
This paper tackles robust resource thresholding for virtualized base stations in O-RAN under non-stationary conditions by casting non-RT policy configuration as an online learning problem. It introduces BSvBS, an adversarial bandit algorithm with sub-linear regret, and MetBS, a meta-learning framework that dynamically selects the best-performing algorithm for a given environment. The authors provide formal regret guarantees, analyze computational overhead, and validate the approach on real-world traces, achieving up to 64.5% energy savings and sub-linear regret across static, stationary, and adversarial scenarios. The work enables practical, low-overhead, robust vBS management in multi-tier O-RAN architectures and suggests a roadmap for integrating such learning-driven policies into live RIC ecosystems.
Abstract
Open Radio Access Network systems, with their virtualized base stations (vBSs), offer operators the benefits of increased flexibility, reduced costs, vendor diversity, and interoperability. Optimizing the allocation of resources in a vBS is challenging since it requires knowledge of the environment, (i.e., "external'' information), such as traffic demands and channel quality, which is difficult to acquire precisely over short intervals of a few seconds. To tackle this problem, we propose an online learning algorithm that balances the effective throughput and vBS energy consumption, even under unforeseeable and "challenging'' environments; for instance, non-stationary or adversarial traffic demands. We also develop a meta-learning scheme, which leverages the power of other algorithmic approaches, tailored for more "easy'' environments, and dynamically chooses the best performing one, thus enhancing the overall system's versatility and effectiveness. We prove the proposed solutions achieve sub-linear regret, providing zero average optimality gap even in challenging environments. The performance of the algorithms is evaluated with real-world data and various trace-driven evaluations, indicating savings of up to 64.5% in the power consumption of a vBS compared with state-of-the-art benchmarks.
