Dynamic Configuration of On-Street Parking Spaces using Multi Agent Reinforcement Learning
Oshada Jayasinghe, Farhana Choudhury, Egemen Tanin, Shanika Karunasekera
TL;DR
This work tackles urban traffic caused by on-street parking by proposing a scalable two-layer MARL framework to dynamically configure parking spaces. It introduces lane-level DQN enhanced with LSTM and GAT to capture spatio-temporal traffic patterns, supervised by block-level agents that maintain neighborhood parking supply. Extensive SUMO-based experiments on synthetic and real Melbourne data demonstrate up to 47% reduction in average travel-time loss with minimal walking-distance penalties and notable emission reductions, outperforming static and other RL baselines. The approach shows strong robustness and practical deployment potential through centralized training with decentralized execution and V2I-enabled information dissemination. It advances infrastructure-centric parking management as a proactive tool for congestion mitigation in smart cities.
Abstract
With increased travelling needs more than ever, traffic congestion has become a major concern in most urban areas. Allocating spaces for on-street parking, further hinders traffic flow, by limiting the effective road width available for driving. With the advancement of vehicle-to-infrastructure connectivity technologies, we explore how the impact of on-street parking on traffic congestion could be minimized, by dynamically configuring on-street parking spaces. Towards that end, we formulate dynamic on-street parking space configuration as an optimization problem, and we follow a data driven approach, considering the nature of our problem. Our proposed solution comprises a two-layer multi agent reinforcement learning based framework, which is inherently scalable to large road networks. The lane level agents are responsible for deciding the optimal parking space configuration for each lane, and we introduce a novel Deep Q-learning architecture which effectively utilizes long short term memory networks and graph attention networks to capture the spatio-temporal correlations evident in the given problem. The block level agents control the actions of the lane level agents and maintain a sufficient level of parking around the block. We conduct a set of comprehensive experiments using SUMO, on both synthetic data as well as real-world data from the city of Melbourne. Our experiments show that the proposed framework could reduce the average travel time loss of vehicles significantly, reaching upto 47%, with a negligible increase in the walking distance for parking.
