RouteRL: Multi-agent reinforcement learning framework for urban route choice with autonomous vehicles
Ahmet Onur Akman, Anastasia Psarou, Łukasz Gorczyca, Zoltán György Varga, Grzegorz Jamróz, Rafał Kucharski
TL;DR
RouteRL addresses the challenge of understanding autonomous vehicle routing in mixed urban traffic by marrying multi-agent reinforcement learning with a high-fidelity microscopic simulator. It introduces an open-source, modular framework that simulates day-to-day route choices for humans and MARL-trained AVs across OpenStreetMap networks, using behavioral models and various MARL algorithms. The work demonstrates experiments on Cologne, Ingolstadt, and Manhattan networks, showing how AV adoption, reward structures, and algorithm choice influence travel times and system performance, while emphasizing reproducibility and policy relevance. By providing a unified testbed with configurable networks, demand, and AV strategies, RouteRL enables rigorous comparisons of MARL approaches and supports research on equity, emissions, and urban mobility in the presence of AV fleets.
Abstract
RouteRL is a novel framework that integrates multi-agent reinforcement learning (MARL) with a microscopic traffic simulation, facilitating the testing and development of efficient route choice strategies for autonomous vehicles (AVs). The proposed framework simulates the daily route choices of driver agents in a city, including two types: human drivers, emulated using behavioral route choice models, and AVs, modeled as MARL agents optimizing their policies for a predefined objective. RouteRL aims to advance research in MARL, transport modeling, and human-AI interaction for transportation applications. This study presents a technical report on RouteRL, outlines its potential research contributions, and showcases its impact via illustrative examples.
