Multi-Agent Reinforcement Learning in Intelligent Transportation Systems: A Comprehensive Survey

Rexcharles Donatus; Kumater Ter; Daniel Udekwe

Multi-Agent Reinforcement Learning in Intelligent Transportation Systems: A Comprehensive Survey

Rexcharles Donatus, Kumater Ter, Daniel Udekwe

TL;DR

A structured taxonomy is introduced that categorizes MARL approaches according to coordination models and learning algorithms, spanning value based, policy based, actor critic, and communication enhanced frameworks, as well as identifying core challenges, including scalability, non stationarity, credit assignment, communication constraints, and the sim to real transfer gap.

Abstract

The growing complexity of urban mobility and the demand for efficient, sustainable, and adaptive solutions have positioned Intelligent Transportation Systems (ITS) at the forefront of modern infrastructure innovation. At the core of ITS lies the challenge of autonomous decision-making across dynamic, large scale, and uncertain environments where multiple agents traffic signals, autonomous vehicles, or fleet units must coordinate effectively. Multi Agent Reinforcement Learning (MARL) offers a promising paradigm for addressing these challenges by enabling distributed agents to jointly learn optimal strategies that balance individual objectives with system wide efficiency. This paper presents a comprehensive survey of MARL applications in ITS. We introduce a structured taxonomy that categorizes MARL approaches according to coordination models and learning algorithms, spanning value based, policy based, actor critic, and communication enhanced frameworks. Applications are reviewed across key ITS domains, including traffic signal control, connected and autonomous vehicle coordination, logistics optimization, and mobility on demand systems. Furthermore, we highlight widely used simulation platforms such as SUMO, CARLA, and CityFlow that support MARL experimentation, along with emerging benchmarks. The survey also identifies core challenges, including scalability, non stationarity, credit assignment, communication constraints, and the sim to real transfer gap, which continue to hinder real world deployment.

Multi-Agent Reinforcement Learning in Intelligent Transportation Systems: A Comprehensive Survey

TL;DR

Abstract

Multi-Agent Reinforcement Learning in Intelligent Transportation Systems: A Comprehensive Survey

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (12)