DecompGAIL: Learning Realistic Traffic Behaviors with Decomposed Multi-Agent Generative Adversarial Imitation Learning
Ke Guo, Haochen Liu, Xiaojun Wu, Chen Lv
TL;DR
DecompGAIL addresses the instability of multi-agent Generative Adversarial Imitation Learning in traffic settings by decomposing realism into ego–map (scene) and ego–neighbor (interaction) components, thereby suppressing weakly relevant neighbor–neighbor and neighbor–map signals. It augments this with a distance-weighted social reward within a SMART-based Transformer backbone to encourage global realism. The approach combines BC pretraining with a decomposed discriminator and social PPO fine-tuning, achieving state-of-the-art realism on the Waymo WOMD Sim Agents 2025 benchmark and exhibiting improved training stability over standard PS-GAIL. This has practical impact for safer, more reliable traffic simulation used in autonomous driving evaluation and urban planning.
Abstract
Realistic traffic simulation is critical for the development of autonomous driving systems and urban mobility planning, yet existing imitation learning approaches often fail to model realistic traffic behaviors. Behavior cloning suffers from covariate shift, while Generative Adversarial Imitation Learning (GAIL) is notoriously unstable in multi-agent settings. We identify a key source of this instability: irrelevant interaction misguidance, where a discriminator penalizes an ego vehicle's realistic behavior due to unrealistic interactions among its neighbors. To address this, we propose Decomposed Multi-agent GAIL (DecompGAIL), which explicitly decomposes realism into ego-map and ego-neighbor components, filtering out misleading neighbor: neighbor and neighbor: map interactions. We further introduce a social PPO objective that augments ego rewards with distance-weighted neighborhood rewards, encouraging overall realism across agents. Integrated into a lightweight SMART-based backbone, DecompGAIL achieves state-of-the-art performance on the WOMD Sim Agents 2025 benchmark.
