Reinforcement Learning Based Oscillation Dampening: Scaling up Single-Agent RL algorithms to a 100 AV highway field operational test
Kathy Jang, Nathan Lichtlé, Eugene Vinitsky, Adit Shah, Matthew Bunting, Matthew Nice, Benedetto Piccoli, Benjamin Seibold, Daniel B. Work, Maria Laura Delle Monache, Jonathan Sprinkle, Jonathan W. Lee, Alexandre M. Bayen
TL;DR
This work investigates reinforcement learning controllers for traffic flow smoothing in the MegaVanderTest, the largest field deployment of automated vehicles to date, encompassing about 100 AVs. It develops two RL controller families (acceleration-based and ACC-based) trained in a data-driven, single-lane simulator that mirrors I-24 traffic dynamics and IDM-based human drivers, and then migrates them to real vehicles via ROS/ONNX interfaces. The paper presents extensive simulation results and a full field test, demonstrating notable improvements in fuel economy and throughput as well as effective dampening of stop-and-go waves, with the deployed system achieving meaningful flow smoothing in congested conditions. Contributions include a scalable cloud-enabled FLOW framework, detailed problem formulations for both controller types, a data-driven training pipeline from real highway trajectories, and a rigorous hardware-validation workflow that bridges simulation to real-world autonomous vehicle control. The findings underscore the practical viability of RL for mixed-autonomy traffic management and highlight design choices that influence safety, generalization, and real-world deployability.
Abstract
In this article, we explore the technical details of the reinforcement learning (RL) algorithms that were deployed in the largest field test of automated vehicles designed to smooth traffic flow in history as of 2023, uncovering the challenges and breakthroughs that come with developing RL controllers for automated vehicles. We delve into the fundamental concepts behind RL algorithms and their application in the context of self-driving cars, discussing the developmental process from simulation to deployment in detail, from designing simulators to reward function shaping. We present the results in both simulation and deployment, discussing the flow-smoothing benefits of the RL controller. From understanding the basics of Markov decision processes to exploring advanced techniques such as deep RL, our article offers a comprehensive overview and deep dive of the theoretical foundations and practical implementations driving this rapidly evolving field. We also showcase real-world case studies and alternative research projects that highlight the impact of RL controllers in revolutionizing autonomous driving. From tackling complex urban environments to dealing with unpredictable traffic scenarios, these intelligent controllers are pushing the boundaries of what automated vehicles can achieve. Furthermore, we examine the safety considerations and hardware-focused technical details surrounding deployment of RL controllers into automated vehicles. As these algorithms learn and evolve through interactions with the environment, ensuring their behavior aligns with safety standards becomes crucial. We explore the methodologies and frameworks being developed to address these challenges, emphasizing the importance of building reliable control systems for automated vehicles.
