Reinforcement learning-based adaptive speed controllers in mixed autonomy condition
Han Wang, Hossein Nick Zinat Matin, Maria Laura Delle Monache
TL;DR
This work tackles congestion in mixed autonomy traffic by modeling the system with a PDE-ODE framework where density $\rho$ obeys $\rho_t + \partial_x f(\rho)=0$ and AV dynamics are captured by an ODE. An RL-based adaptive speed controller is learned via an Actor-Critic policy that interacts with the PDE-ODE to regulate AV speed in real time. Key findings include a 15% improvement in minimum flux, a 17% increase in average global speed, a 35% reduction in speed variation, and a 16% improvement in PPO-normalized reward, demonstrating effective shockwave mitigation and flow stabilization. The approach offers a practical path toward leveraging AVs for traffic efficiency, with future work addressing scalability, real-data integration, safety, and socio-economic considerations.
Abstract
The integration of Automated Vehicles (AVs) into traffic flow holds the potential to significantly improve traffic congestion by enabling AVs to function as actuators within the flow. This paper introduces an adaptive speed controller tailored for scenarios of mixed autonomy, where AVs interact with human-driven vehicles. We model the traffic dynamics using a system of strongly coupled Partial and Ordinary Differential Equations (PDE-ODE), with the PDE capturing the general flow of human-driven traffic and the ODE characterizing the trajectory of the AVs. A speed policy for AVs is derived using a Reinforcement Learning (RL) algorithm structured within an Actor-Critic (AC) framework. This algorithm interacts with the PDE-ODE model to optimize the AV control policy. Numerical simulations are presented to demonstrate the controller's impact on traffic patterns, showing the potential of AVs to improve traffic flow and reduce congestion.
