Decentralized Interference-Aware Codebook Learning in Millimeter Wave MIMO Systems

Yu Zhang; Ahmed Alkhateeb

Decentralized Interference-Aware Codebook Learning in Millimeter Wave MIMO Systems

Yu Zhang, Ahmed Alkhateeb

TL;DR

This work addresses interference-aware codebook learning for mmWave MIMO in multi-cell networks where base stations operate asynchronously and cannot exchange information. It introduces a fully decentralized multi-agent reinforcement learning framework that uses a power-measurement averaging estimator to assess interference suppression and a decoupled reward to stabilize learning across nodes. The authors provide theoretical justification showing the averaging-based estimator is a sufficient statistic asymptotically in large antenna regimes, and they validate the approach with simulations demonstrating well-shaped learned codebooks that create deep nulls toward interference without inter-BS communication. The proposed method enables scalable, decentralized beam codebook design for dense mmWave networks, reducing coordination overhead while achieving substantial interference suppression and improved SIR distributions.

Abstract

Beam codebooks are integral components of the future millimeter wave (mmWave) multiple input multiple output (MIMO) system to relax the reliance on the instantaneous channel state information (CSI). The design of these codebooks, therefore, becomes one of the fundamental problems for these systems, and the well-designed codebooks play key roles in enabling efficient and reliable communications. Prior work has primarily focused on the codebook learning problem within a single cell/network and under stationary interference. In this work, we generalize the interference-aware codebook learning problem to networks with multiple cells/basestations. One of the key differences compared to the single-cell codebook learning problem is that the underlying environment becomes non-stationary, as the behavior of one base station will influence the learning of the others. Moreover, to encompass some of the challenging scenarios, information exchange between the different learning nodes is not allowed, which leads to a fully decentralized system with significantly increased learning difficulties. To tackle the non-stationarity, the averaging of the measurements is used to estimate the interference nulling performance of a particular beam, based on which a decision rule is provided. Furthermore, we theoretically justify the adoption of such estimator and prove that it is a sufficient statistic for the underlying quantity of interest in an asymptotic sense. Finally, a novel reward function based on averaging is proposed to fully decouple the learning of the multiple agents running at different nodes. Simulation results show that the developed solution is capable of learning well-shaped codebook patterns for different networks that significantly suppress the interference without information exchange, highlighting ...

Decentralized Interference-Aware Codebook Learning in Millimeter Wave MIMO Systems

TL;DR

Abstract

Paper Structure (20 sections, 3 theorems, 49 equations, 5 figures, 1 table)

This paper contains 20 sections, 3 theorems, 49 equations, 5 figures, 1 table.

Introduction
System and Channel Models
System Model
Channel Model
Problem Formulation
Decentralized Reinforcement Learning Solution
Beam Learning Under Non-Stationary Interference
Estimating the Interference Suppression Performance
Determining the Reward
Practical Operations
Simulation Results
Simulation Setup
Evaluation Method
Numerical Results
Hypothesis testing accuracy
...and 5 more sections

Key Result

Proposition 1

Assume that $\mathrm{Var}[\mathbf{z}]=\boldsymbol{\Sigma}\succ 0$, and there are $\tilde{\boldsymbol{\xi}}$ and $\tilde{\boldsymbol{\xi}^\prime}$, such that where $\lambda_L(\cdot)$ denotes the $L$-th largest eigenvalue of a positive definite matrix, and $\boldsymbol{\Pi}\in\mathbb{C}^{L\times L}$ is defined as Then $\mathbb{E}\left[\mathbf{z}^H\mathbf{A}\mathbf{z}\right]<\mathbb{E}\left[\mathbf

Figures (5)

Figure 1: The considered scenario where there are multiple mmWave base stations operating at the same time and frequency to serve the surrounding users. Besides, there is no coordination among those base stations (such as user scheduling, power control, etc.) and no information sharing, which leads to unavoidable in-band interference that limits the system performance.
Figure 2: The ROC curves of the proposed decision rule with different number of measurements, where (a) shows the case when the other BS is using random transmit beams and (b) shows the case when the other BS is using a fixed DFT codebook.
Figure 3: The simulation results of the proposed beam codebook learning solution, where (a) depicts the considered outdoor communication scenario, (b) shows the achieved SIR map (in decibel) when both BSs using the beamsteering codebooks, and (c) shows the achieved SIR map (in decibel) when both of them using the learned codebooks.
Figure 4: The selected beam patterns from the learned 16-beam codebook of BS 3, where the dashed blue line indicating the direction of the main lobe of the beam and the dashed red line indicating the direction of the interference.
Figure 5: The selected beam patterns from the learned 16-beam codebook of BS 4, where the dashed blue line indicating the direction of the main lobe of the beam and the dashed red line indicating the direction of the interference.

Theorems & Definitions (6)

Proposition 1
proof
Proposition 2
proof
Corollary 1
proof

Decentralized Interference-Aware Codebook Learning in Millimeter Wave MIMO Systems

TL;DR

Abstract

Decentralized Interference-Aware Codebook Learning in Millimeter Wave MIMO Systems

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (5)

Theorems & Definitions (6)