Cooperative Cruising: Reinforcement Learning-Based Time-Headway Control for Increased Traffic Efficiency
Yaron Veksler, Sharon Hornstein, Han Wang, Maria Laura Delle Monache, Daniel Urieli
TL;DR
This paper tackles highway congestion in realistic multi-lane settings by proposing a centralized reinforcement-learning-based controller that emits time-headway commands to CAVs near bottlenecks, integrated with safety-certified ACC and low-bandwidth V2I. The method is trained in large-scale SUMO simulations of a 2 km I-24 highway segment, using an MDP formulation with states over 21 road segments, actions as headway vectors, and a time-delay reward tied to free-flow performance. Empirical results show that the RL controller yields up to 13% average speed improvements in single-lane scenarios and up to 7% in multi-lane scenarios, outperforming a fixed-headway baseline across varying CAV penetration. The work emphasizes practicality and safety, offering deployable gains by targeting localized bottlenecks, and provides a novel average-speed metric for open-road simulations to better reflect real-world traffic dynamics.
Abstract
The proliferation of connected automated vehicles represents an unprecedented opportunity for improving driving efficiency and alleviating traffic congestion. However, existing research fails to address realistic multi-lane highway scenarios without assuming connectivity, perception, and control capabilities that are typically unavailable in current vehicles. This paper proposes a novel AI system that is the first to improve highway traffic efficiency compared with human-like traffic in realistic, simulated multi-lane scenarios, while relying on existing connectivity, perception, and control capabilities. At the core of our approach is a reinforcement learning based controller that dynamically communicates time-headways to automated vehicles near bottlenecks based on real-time traffic conditions. These desired time-headways are then used by adaptive cruise control (ACC) systems to adjust their following distance. By (i) integrating existing traffic estimation technology and low-bandwidth vehicle-to-infrastructure connectivity, (ii) leveraging safety-certified ACC systems, and (iii) targeting localized bottleneck challenges that can be addressed independently in different locations, we propose a potentially practical, safe, and scalable system that can positively impact numerous road users.
