Unicorn: A Universal and Collaborative Reinforcement Learning Approach Towards Generalizable Network-Wide Traffic Signal Control

Yifeng Zhang; Yilin Liu; Ping Gong; Peizhuo Li; Mingfeng Fan; Guillaume Sartoretti

Unicorn: A Universal and Collaborative Reinforcement Learning Approach Towards Generalizable Network-Wide Traffic Signal Control

Yifeng Zhang, Yilin Liu, Ping Gong, Peizhuo Li, Mingfeng Fan, Guillaume Sartoretti

TL;DR

Unicorn tackles generalizable network-wide ATSC by unifying intersection state-action representations around traffic movements and introducing two core modules: Universal Traffic Representation ($UTR$) for general feature extraction and Intersection Specifics Representation ($ISR$) for topology-aware latent modeling via a variational autoencoder. A contrastive learning objective refines latent representations, while collaborative policy optimization leverages neighboring actions with an attention mechanism to enable efficient neighborhood coordination under a PPO framework. Evaluations across eight datasets show Unicorn consistently outperforms baselines, including both independent and shared-parameter MARL methods, in metrics like queue length, average speed, and trip delay, and joint training further attains robust generalization. These findings imply substantial practical impact for deployment in real urban networks, enabling scalable, adaptable, and cooperative traffic signal control across heterogeneous topologies and demands.

Abstract

Adaptive traffic signal control (ATSC) is crucial in reducing congestion, maximizing throughput, and improving mobility in rapidly growing urban areas. Recent advancements in parameter-sharing multi-agent reinforcement learning (MARL) have greatly enhanced the scalable and adaptive optimization of complex, dynamic flows in large-scale homogeneous networks. However, the inherent heterogeneity of real-world traffic networks, with their varied intersection topologies and interaction dynamics, poses substantial challenges to achieving scalable and effective ATSC across different traffic scenarios. To address these challenges, we present Unicorn, a universal and collaborative MARL framework designed for efficient and adaptable network-wide ATSC. Specifically, we first propose a unified approach to map the states and actions of intersections with varying topologies into a common structure based on traffic movements. Next, we design a Universal Traffic Representation (UTR) module with a decoder-only network for general feature extraction, enhancing the model's adaptability to diverse traffic scenarios. Additionally, we incorporate an Intersection Specifics Representation (ISR) module, designed to identify key latent vectors that represent the unique intersection's topology and traffic dynamics through variational inference techniques. To further refine these latent representations, we employ a contrastive learning approach in a self-supervised manner, which enables better differentiation of intersection-specific features. Moreover, we integrate the state-action dependencies of neighboring agents into policy optimization, which effectively captures dynamic agent interactions and facilitates efficient regional collaboration. Our results show that Unicorn outperforms other methods across various evaluation metrics, highlighting its potential in complex, dynamic traffic networks.

Unicorn: A Universal and Collaborative Reinforcement Learning Approach Towards Generalizable Network-Wide Traffic Signal Control

TL;DR

Abstract

Unicorn: A Universal and Collaborative Reinforcement Learning Approach Towards Generalizable Network-Wide Traffic Signal Control

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)