Multi-Task Lane-Free Driving Strategy for Connected and Automated Vehicles: A Multi-Agent Deep Reinforcement Learning Approach
Mehran Berahman, Majid Rostami-Shahrbabaki, Klaus Bogenberger
TL;DR
The paper addresses decision-making for lane-free driving of connected and automated vehicles under non-stationary traffic. It introduces a centralized-training, decentralized-execution MADDPG framework that uses dynamic elliptical safety borders and inter-vehicle nudging and repulsion forces to enable overtaking, merging, and safe following in lane-free environments. Trained on a lane-free ring road in SUMO and evaluated on a 4 km freeway, the approach demonstrates substantial capacity gains (up to ~17,000 veh/h) and emergent lateral sorting by desired speed, while prioritizing safety and passenger comfort. Limitations include training in relatively low-density traffic and plans for transfer learning and cooperative multi-agent extensions to further enhance performance in denser networks.
Abstract
Deep reinforcement learning has shown promise in various engineering applications, including vehicular traffic control. The non-stationary nature of traffic, especially in the lane-free environment with more degrees of freedom in vehicle behaviors, poses challenges for decision-making since a wrong action might lead to a catastrophic failure. In this paper, we propose a novel driving strategy for Connected and Automated Vehicles (CAVs) based on a competitive Multi-Agent Deep Deterministic Policy Gradient approach. The developed multi-agent deep reinforcement learning algorithm creates a dynamic and non-stationary scenario, mirroring real-world traffic complexities and making trained agents more robust. The algorithm's reward function is strategically and uniquely formulated to cover multiple vehicle control tasks, including maintaining desired speeds, overtaking, collision avoidance, and merging and diverging maneuvers. Moreover, additional considerations for both lateral and longitudinal passenger comfort and safety criteria are taken into account. We employed inter-vehicle forces, known as nudging and repulsive forces, to manage the maneuvers of CAVs in a lane-free traffic environment. The proposed driving algorithm is trained and evaluated on lane-free roads using the Simulation of Urban Mobility platform. Experimental results demonstrate the algorithm's efficacy in handling different objectives, highlighting its potential to enhance safety and efficiency in autonomous driving within lane-free traffic environments.
