GPUDrive: Data-driven, multi-agent driving simulation at 1 million FPS
Saman Kazemkhani, Aarav Pandya, Daphne Cornelisse, Brennan Shacklett, Eugene Vinitsky
TL;DR
GPUDrive tackles the bottleneck of learning-based multi-agent planning by delivering a GPU-accelerated, data-driven driving simulator capable of over a million steps per second. By leveraging the Madrona engine, BVH-based collision culling, and polyline decimation, it scales to hundreds of worlds with hundreds of agents while providing LiDAR and human-view sensor modalities and Python access for RL/IL workflows. The authors demonstrate training RL agents on the Waymo Open Motion Dataset, achieving high goal-reaching rates in minutes and scaling to thousands of scenarios in hours, with substantial end-to-end speedups over CPU-based or less scalable simulators. They also open-source the simulator, baselines, and training loops to foster reproducibility and broader research in data-driven, multi-agent autonomous driving. This work enables rapid experimentation and evaluation of autonomous driving planners under diverse, complex multi-agent interactions.
Abstract
Multi-agent learning algorithms have been successful at generating superhuman planning in various games but have had limited impact on the design of deployed multi-agent planners. A key bottleneck in applying these techniques to multi-agent planning is that they require billions of steps of experience. To enable the study of multi-agent planning at scale, we present GPUDrive. GPUDrive is a GPU-accelerated, multi-agent simulator built on top of the Madrona Game Engine capable of generating over a million simulation steps per second. Observation, reward, and dynamics functions are written directly in C++, allowing users to define complex, heterogeneous agent behaviors that are lowered to high-performance CUDA. Despite these low-level optimizations, GPUDrive is fully accessible through Python, offering a seamless and efficient workflow for multi-agent, closed-loop simulation. Using GPUDrive, we train reinforcement learning agents on the Waymo Open Motion Dataset, achieving efficient goal-reaching in minutes and scaling to thousands of scenarios in hours. We open-source the code and pre-trained agents at https://github.com/Emerge-Lab/gpudrive.
