CoLLMLight: Cooperative Large Language Model Agents for Network-Wide Traffic Signal Control
Zirui Yuan, Siqi Lai, Hao Liu
TL;DR
CoLLMLight addresses the challenge of network-wide traffic signal control by enabling cooperative LLM agents that reason over a structured spatiotemporal graph of intersections. It introduces a complexity-aware reasoning mechanism to adapt inference depth to current congestion and a simulation-driven fine-tuning pipeline that builds task-specific reasoning chains and refines policies using environmental feedback. The framework demonstrates strong generalization across synthetic and seven real-world datasets, outperforming traditional methods, RL-based controllers, and generic LLM baselines, particularly in highly coordinated networks like New York. Overall, CoLLMLight offers a scalable, efficient approach to cross-intersection coordination with concrete mechanisms for spatiotemporal reasoning, adaptive computation, and continual improvement through simulation and feedback.
Abstract
Traffic Signal Control (TSC) plays a critical role in urban traffic management by optimizing traffic flow and mitigating congestion. While Large Language Models (LLMs) have recently emerged as promising tools for TSC due to their exceptional problem-solving and generalization capabilities, existing approaches fail to address the essential need for inter-agent coordination, limiting their effectiveness in achieving network-wide optimization. To bridge this gap, we propose CoLLMLight, a cooperative LLM agent framework for TSC. Specifically, we first construct a structured spatiotemporal graph to capture real-time traffic dynamics and spatial relationships among neighboring intersections, enabling the LLM to reason about complex traffic interactions. Moreover, we introduce a complexity-aware reasoning mechanism that dynamically adapts reasoning depth based on real-time traffic conditions, ensuring optimal computational efficiency without sacrificing decision quality. Besides, we propose a fine-tuning strategy that leverages iterative simulation-driven data collection and environmental feedback to build a lightweight LLM tailored for cooperative TSC. Extensive experiments on both synthetic and real-world datasets demonstrate that CoLLMLight outperforms state-of-the-art methods in diverse traffic scenarios, showcasing its effectiveness, scalability, and robustness.
