SLEDGE: Synthesizing Driving Environments with Generative Models and Rule-Based Traffic
Kashyap Chitta, Daniel Dauner, Andreas Geiger
TL;DR
The paper tackles the challenge of generating realistic, long-horizon driving scenes for planning evaluation by moving beyond log replay to a diffusion-based generative framework. It introduces a Rasterized Latent Map (RLM) learned via a Raster-to-Vector Autoencoder (RVAE) and a Diffusion Transformer (DiT) to jointly produce lane graphs and agent states, with city-conditioned generation and inpainting capabilities for route extrapolation. Key contributions include formalizing abstract scene generation with a new benchmark and metrics, showing that RLM-based representations with channel masking approach ground-truth vector quality, and demonstrating a storage-efficient simulator, SLEDGE, capable of long-horizon, density-controlled planning tests with up to 500 m routes. The work enables scalable, controllable data-driven testing of motion planning algorithms, potentially democratizing access to rigorous evaluation and driving advances in autonomous driving research.
Abstract
SLEDGE is the first generative simulator for vehicle motion planning trained on real-world driving logs. Its core component is a learned model that is able to generate agent bounding boxes and lane graphs. The model's outputs serve as an initial state for rule-based traffic simulation. The unique properties of the entities to be generated for SLEDGE, such as their connectivity and variable count per scene, render the naive application of most modern generative models to this task non-trivial. Therefore, together with a systematic study of existing lane graph representations, we introduce a novel raster-to-vector autoencoder. It encodes agents and the lane graph into distinct channels in a rasterized latent map. This facilitates both lane-conditioned agent generation and combined generation of lanes and agents with a Diffusion Transformer. Using generated entities in SLEDGE enables greater control over the simulation, e.g. upsampling turns or increasing traffic density. Further, SLEDGE can support 500m long routes, a capability not found in existing data-driven simulators like nuPlan. It presents new challenges for planning algorithms, evidenced by failure rates of over 40% for PDM, the winner of the 2023 nuPlan challenge, when tested on hard routes and dense traffic generated by our model. Compared to nuPlan, SLEDGE requires 500$\times$ less storage to set up (<4 GB), making it a more accessible option and helping with democratizing future research in this field.
