Optimistic Learning for Communication Networks
George Iosifidis, Naram Mhaisen, Douglas J. Leith
TL;DR
Optimistic Learning (OpL) integrates offline predictive models with online convex optimization to accelerate decision-making in dynamic communication networks while preserving online robustness. By incorporating gradient or function predictions into online algorithms (notably OFTRL and related variants), OpL achieves regret that scales with the prediction error, up to $\mathcal{O}(1)$ when forecasts are accurate, and gracefully degrades to standard OCO bounds otherwise. The tutorial develops the theory (definitions, regret bounds, adaptivity), presents multiple OpL algorithms, and demonstrates applications to caching, edge computing, network slicing, and O-RAN workload assignment, including memory-aware and discrete-placement problems. It also outlines future directions, including hybrid optimism, SEA models, non-convex settings, and joint predictor-learner design, to broaden OpL’s applicability in next-generation networks. Overall, OpL provides a principled, universal framework for leveraging predictive information in network control with strong worst-case guarantees and practical performance gains.
Abstract
AI/ML-based tools are at the forefront of resource management solutions for communication networks. Deep learning, in particular, is highly effective in facilitating fast and high-performing decision-making whenever representative training data is available to build offline accurate models. Conversely, online learning solutions do not require training and enable adaptive decisions based on runtime observations, alas are often overly conservative. This extensive tutorial proposes the use of optimistic learning (OpL) as a decision engine for resource management frameworks in modern communication systems. When properly designed, such solutions can achieve fast and high-performing decisions -- comparable to offline-trained models -- while preserving the robustness and performance guarantees of the respective online learning approaches. We introduce the fundamental concepts, algorithms and results of OpL, discuss the roots of this theory and present different approaches to defining and achieving optimism. We proceed to showcase how OpL can enhance resource management in communication networks for several key problems such as caching, edge computing, network slicing, and workload assignment in decentralized O-RAN platforms. Finally, we discuss the open challenges that must be addressed to unlock the full potential of this new resource management approach.
