Adaptive Design of mmWave Initial Access Codebooks using Reinforcement Learning

Sabrine Aroua; Christos Anastasios Bovolis; Bo Göransson; Anastasios Giovanidis; Mathieu Leconte; Apostolos Destounis

Adaptive Design of mmWave Initial Access Codebooks using Reinforcement Learning

Sabrine Aroua, Christos Anastasios Bovolis, Bo Göransson, Anastasios Giovanidis, Mathieu Leconte, Apostolos Destounis

TL;DR

This work addresses adaptive initial access in mmWave 5G by learning to design SSB codebooks from a pool of expert beams. Framing the problem as a POMDP, it employs an actor–critic RL policy that sequentially selects $n$ beams from a pool of $m$ beams to maximize UE association during cell search, while leveraging expert designs to ensure safety and compatibility. Empirical results show the Neural Codebook consistently outperforms static expert codebooks and several baselines, achieving up to $>10\%$ gains in user connectivity and higher rediscovery rates across diverse deployments. The approach offers a practical, data-driven pathway to more flexible, resilient beam management in next-generation wireless networks, with potential extensions to direct SNR optimization and multi-metric objectives.

Abstract

Initial access (IA) is the process by which user equipment (UE) establishes its first connection with a base station. In 5G systems, particularly at millimeter-wave frequencies, IA integrates beam management to support highly directional transmissions. The base station employs a codebook of beams for the transmission of Synchronization Signal Blocks (SSBs), which are periodically swept to detect and connect users. The design of this SSB codebook is critical for ensuring reliable, wide-area coverage. In current networks, SSB codebooks are meticulously engineered by domain experts. While these expert-defined codebooks provide a robust baseline, they lack flexibility in dynamic or heterogeneous environments where user distributions vary, limiting their overall effectiveness. This paper proposes a hybrid Reinforcement Learning (RL) framework for adaptive SSB codebook design. Building on top of expert knowledge, the RL agent leverages a pool of expert-designed SSB beams and learns to adaptively select or combine them based on real-time feedback. This enables the agent to dynamically tailor codebooks to the actual environment, without requiring explicit user location information, while always respecting practical beam constraints. Simulation results demonstrate that, on average, the proposed approach improves user connectivity by 10.8$\%$ compared to static expert configurations. These findings highlight the potential of combining expert knowledge with data-driven optimization to achieve more intelligent, flexible, and resilient beam management in next-generation wireless networks.

Adaptive Design of mmWave Initial Access Codebooks using Reinforcement Learning

TL;DR

Abstract

Adaptive Design of mmWave Initial Access Codebooks using Reinforcement Learning

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)