Dyna-Style Learning with A Macroscopic Model for Vehicle Platooning in Mixed-Autonomy Traffic
Yichuan Zou, Li Jin, Xi Xiong
TL;DR
The paper tackles fuel-efficient platooning in mixed-autonomy highway traffic by coupling a macroscopic LWR-based PDE with a platoon-tracking ODE and augmenting it with Kalman updates to adapt density and speed parameters. It introduces a Dyna-style planning and learning framework that uses this interpretable PDE-ODE model as a world model to generate virtual experiences for Q-learning, aiming to minimize fuel consumption. Empirical results in SUMO show a roughly 10% reduction in total fuel use and faster convergence than model-free baselines, alongside improved congestion metrics at bottlenecks. The approach offers a transparent, data-efficient alternative to neural-network heavy models and demonstrates practical impact for highway efficiency in mixed autonomy scenarios.
Abstract
Platooning of connected and autonomous vehicles (CAVs) plays a vital role in modernizing highways, ushering in enhanced efficiency and safety. This paper explores the significance of platooning in smart highways, employing a coupled partial differential equation (PDE) and ordinary differential equation (ODE) model to elucidate the complex interaction between bulk traffic flow and CAV platoons. Our study focuses on developing a Dyna-style planning and learning framework tailored for platoon control, with a specific goal of reducing fuel consumption. By harnessing the coupled PDE-ODE model, we improve data efficiency in Dyna-style learning through virtual experiences. Simulation results validate the effectiveness of our macroscopic model in modeling platoons within mixed-autonomy settings, demonstrating a notable $10.11\%$ reduction in vehicular fuel consumption compared to conventional approaches.
