Multi-agent Reinforcement Learning-based Joint Precoding and Phase Shift Optimization for RIS-aided Cell-Free Massive MIMO Systems

Yiyang Zhu; Enyu Shi; Ziheng Liu; Jiayi Zhang; Bo Ai

Multi-agent Reinforcement Learning-based Joint Precoding and Phase Shift Optimization for RIS-aided Cell-Free Massive MIMO Systems

Yiyang Zhu, Enyu Shi, Ziheng Liu, Jiayi Zhang, Bo Ai

TL;DR

This work tackles the challenge of maximizing sum spectral efficiency in RIS-aided cell-free mMIMO by jointly optimizing AP precoding and RIS phase shifts. It introduces a centralized-training, distributed-execution MARL framework enhanced with fuzzy logic (FL-MARL), where each AP acts as an agent and phase shifts are learned via a two-layer network using local CSI, reducing backhaul burden. The approach reformulates the non-convex problem into a MARL-CTDE setting and demonstrates that FL-MARL achieves SE performance close to MADDPG while significantly reducing computational complexity and speeding up convergence (about 30% faster in reported tests) compared to conventional methods. Empirical results show that the proposed method scales well with more antennas, RISs, and RIS elements, outperforming alternating optimization and offering practical benefits for large, backhaul-constrained deployments. The findings underscore FL-MARL as a viable, scalable solution for real-world RIS-aided CF mMIMO systems, with future work on STAR-RIS and imperfect CSI scenarios.

Abstract

Cell-free (CF) massive multiple-input multiple-output (mMIMO) is a promising technique for achieving high spectral efficiency (SE) using multiple distributed access points (APs). However, harsh propagation environments often lead to significant communication performance degradation due to high penetration loss. To overcome this issue, we introduce the reconfigurable intelligent surface (RIS) into the CF mMIMO system as a low-cost and power-efficient solution. In this paper, we focus on optimizing the joint precoding design of the RIS-aided CF mMIMO system to maximize the sum SE. This involves optimizing the precoding matrix at the APs and the reflection coefficients at the RIS. To tackle this problem, we propose a fully distributed multi-agent reinforcement learning (MARL) algorithm that incorporates fuzzy logic (FL). Unlike conventional approaches that rely on alternating optimization techniques, our FL-based MARL algorithm only requires local channel state information, which reduces the need for high backhaul capacity. Simulation results demonstrate that our proposed FL-MARL algorithm effectively reduces computational complexity while achieving similar performance as conventional MARL methods.

Multi-agent Reinforcement Learning-based Joint Precoding and Phase Shift Optimization for RIS-aided Cell-Free Massive MIMO Systems

TL;DR

Abstract

Paper Structure (23 sections, 12 equations, 5 figures, 2 tables, 1 algorithm)

This paper contains 23 sections, 12 equations, 5 figures, 2 tables, 1 algorithm.

Introduction
System Model
Channel Model
Transmitters and Receivers
Problem Formulation
Proposed Joint Precoding and Phase Shift Optimization Framework
Overview of the Framework
Fuzzy Logic
Agent
State space
Initialization
Fuzzy Action space
Reward space
FL-MARL Algorithm
Complexity comparison
...and 8 more sections

Figures (5)

Figure 1: The RIS-aided CF mMIMO system and the proposed MARL precoding network.
Figure 2: Average reward against the training step with $step$ = 100, $L$ = 4, $K$ = 4, $R$ = 4, $M$ = 8, $U$ = 1, and $N$ = 16.
Figure 3: Sum-SE against the number of AP antennas with $L$ = 4, $K$ = 4, $R$ = 4, $U$ = 1, and $N$ = 16.
Figure 4: Sum-SE against the number of RISs with $L$ = 4, $K$ = 4, $M$ = 8, $U$ = 1 and $N$ = 16.
Figure 5: Sum-SE against the number of RIS elements with $L$ = 4, $K$ = 4, $R$ = 4, $M$ = 8, and $U$ = 1.

Multi-agent Reinforcement Learning-based Joint Precoding and Phase Shift Optimization for RIS-aided Cell-Free Massive MIMO Systems

TL;DR

Abstract

Multi-agent Reinforcement Learning-based Joint Precoding and Phase Shift Optimization for RIS-aided Cell-Free Massive MIMO Systems

Authors

TL;DR

Abstract

Table of Contents

Figures (5)