DynamicLight: Two-Stage Dynamic Traffic Signal Timing
Liang Zhang, Yutong Zhang, Shubin Xie, Jianming Deng, Chen Li
TL;DR
DynamicLight addresses the rigidity of single-stage reinforcement learning for traffic signal control by introducing a two-stage framework that separately optimizes phase and duration. It employs a unified Deep Q-Network with lane-level feature fusion via multi-head attention to select a phase and its duration, enabling dynamic phase lengths across intersections. Across real-world (JN/HZ/NY) and synthetic topologies in CityFlow, DynamicLight achieves state-of-the-art ATT reductions versus Advanced-CoLight and other baselines, and its variants demonstrate robust scalability and transferability. The work highlights practical potential for real-world deployment, while noting computational demands and lack of inter-intersection coordination as future directions.
Abstract
Reinforcement learning (RL) is gaining popularity as an effective approach for traffic signal control (TSC) and is increasingly applied in this domain. However, most existing RL methodologies are confined to a single-stage TSC framework, primarily focusing on selecting an appropriate traffic signal phase at fixed action intervals, leading to inflexible and less adaptable phase durations. To address such limitations, we introduce a novel two-stage TSC framework named DynamicLight. This framework initiates with a phase control strategy responsible for determining the optimal traffic phase, followed by a duration control strategy tasked with determining the corresponding phase duration. Experimental results show that DynamicLight outperforms state-of-the-art TSC models and exhibits exceptional model generalization capabilities. Additionally, the robustness and potential for real-world implementation of DynamicLight are further demonstrated and validated through various DynamicLight variants. The code is released at https://github.com/LiangZhang1996/DynamicLight.
