Real-World Deployment and Assessment of a Multi-Agent Reinforcement Learning-Based Variable Speed Limit Control System
Yuhang Zhang, Zhiyao Zhang, Junyi Ji, Marcos Quiñones-Grueiro, William Barbour, Derek Gloudemans, Gergely Zachár, Clay Weston, Gautam Biswas, Daniel B. Work
TL;DR
The paper tackles the challenge of validating a MARL-based variable speed limit system in a real-world highway setting. It presents a full deployment pipeline that trains policies in simulation, augments them with invalid-action masking and safety guards, and operates them through an AI-DSS integrated with the TMC for live field control on I-24 with 67 gantries. Key findings include high autonomous control feasibility (up to 98% of decisions executed without guards), improved proactive warning accuracy and responsiveness (14% higher accuracy and 75% faster response than a benchmark), and promising safety benefits (crash reductions of up to 26% for crashes and 50% for secondary crashes) over the deployed corridor. The work demonstrates the practical viability of scalable MARL for real-world traffic control, delivers open-source code and datasets, and provides a foundation for future enhancements in driver compliance and policy distillation for VSL systems.
Abstract
This article presents the first field deployment of a multi-agent reinforcement learning (MARL) based variable speed limit (VSL) control system on Interstate 24 (I-24) near Nashville, Tennessee. We design and demonstrate a full pipeline from training MARL agents in a traffic simulator to a field deployment on a 17-mile segment of I-24 encompassing 67 VSL controllers. The system was launched on March 8th, 2024, and has made approximately 35 million decisions on 28 million trips in six months of operation. We apply an invalid action masking mechanism and several safety guards to ensure real-world constraints. The MARL-based implementation operates up to 98% of the time, with the safety guards overriding the MARL decisions for the remaining time. We evaluate the performance of the MARL-based algorithm in comparison to a previously deployed non-RL VSL benchmark algorithm on I-24. Results show that the MARL-based VSL control system achieves a superior performance. The accuracy of correctly warning drivers about slowing traffic ahead is improved by 14% and the response delay to non-recurrent congestion is reduced by 75%. The preliminary data shows that the VSL control system has reduced the crash rate by 26% and the secondary crash rate by 50%. We open-sourced the deployed MARL-based VSL algorithm at https://github.com/Lab-Work/marl-vsl-controller.
