Communication-Aware Reinforcement Learning for Cooperative Adaptive Cruise Control

Sicong Jiang; Seongjin Choi; Lijun Sun

Communication-Aware Reinforcement Learning for Cooperative Adaptive Cruise Control

Sicong Jiang, Seongjin Choi, Lijun Sun

TL;DR

CARL addresses scalability and robustness in multi-agent CACC by introducing a Communication-Aware Reinforcement Learning framework that uses V2V forward and backward information transfer to speed up cyclic information propagation while preserving a shared policy. The CA module processes high-dimensional communication data with neural networks and integrates with actor-critic algorithms (eg CA-DDPG, CA-TD3) to enable centralized training with decentralized execution. Evaluation on the NGSIM highway dataset shows that CARL improves headway, jerk, speed, TTC safety, and string stability, and generalizes across varying platoon sizes and unseen scenarios, outperforming IDM, Krauss, MADDPG, and standard DDPG/TD3 baselines. These results highlight CARL's potential to enhance safety, efficiency, and scalability in real-world CACC deployments and point to future work on adapting to different road conditions and broader driving tasks.

Abstract

Cooperative Adaptive Cruise Control (CACC) plays a pivotal role in enhancing traffic efficiency and safety in Connected and Autonomous Vehicles (CAVs). Reinforcement Learning (RL) has proven effective in optimizing complex decision-making processes in CACC, leading to improved system performance and adaptability. Among RL approaches, Multi-Agent Reinforcement Learning (MARL) has shown remarkable potential by enabling coordinated actions among multiple CAVs through Centralized Training with Decentralized Execution (CTDE). However, MARL often faces scalability issues, particularly when CACC vehicles suddenly join or leave the platoon, resulting in performance degradation. To address these challenges, we propose Communication-Aware Reinforcement Learning (CA-RL). CA-RL includes a communication-aware module that extracts and compresses vehicle communication information through forward and backward information transmission modules. This enables efficient cyclic information propagation within the CACC traffic flow, ensuring policy consistency and mitigating the scalability problems of MARL in CACC. Experimental results demonstrate that CA-RL significantly outperforms baseline methods in various traffic scenarios, achieving superior scalability, robustness, and overall system performance while maintaining reliable performance despite changes in the number of participating vehicles.

Communication-Aware Reinforcement Learning for Cooperative Adaptive Cruise Control

TL;DR

Abstract

Paper Structure (25 sections, 17 equations, 5 figures, 3 tables)

This paper contains 25 sections, 17 equations, 5 figures, 3 tables.

Introduction
Methodology
Problem Formulation
States and Observations
Actions
Rewards
Communication-Aware RL with Message Processing
Implementation with Actor-Critic Network
Experimental Setting
Dataset
Simulation Environment
Baseline Models
Evaluation Metrics
Results
Aggregated Measures
...and 10 more sections

Figures (5)

Figure 1: Architecture of Communication-Aware Module: The communication module receives the information from the front and rear cars and then processes the information using a network module, after which the information is then output to the surrounding vehicles. The output of the current action is given using the obtained information together with the RL network.
Figure 2: Communication-Aware RL framework combined with Actor-Critic network
Figure 3: Empirical cumulative distribution of different models in speed, headway and jerk
Figure 4: Position, speed, acceleration versus time step t in an NGSIM trajectory for different models with different initial spacing settings
Figure 5: Position, speed, acceleration versus time step t for different models under stop-and-go scenario.

Communication-Aware Reinforcement Learning for Cooperative Adaptive Cruise Control

TL;DR

Abstract

Communication-Aware Reinforcement Learning for Cooperative Adaptive Cruise Control

Authors

TL;DR

Abstract

Table of Contents

Figures (5)