SIMSplat: Predictive Driving Scene Editing with Language-aligned 4D Gaussian Splatting

Sung-Yeon Park; Adam Lee; Juanwu Lu; Can Cui; Luyang Jiang; Rohit Gupta; Kyungtae Han; Ahmadreza Moradipari; Ziran Wang

SIMSplat: Predictive Driving Scene Editing with Language-aligned 4D Gaussian Splatting

Sung-Yeon Park, Adam Lee, Juanwu Lu, Can Cui, Luyang Jiang, Rohit Gupta, Kyungtae Han, Ahmadreza Moradipari, Ziran Wang

TL;DR

SIMSplat tackles the challenge of editable, realistic driving-scene generation by unifying language-grounded querying with a scene-graph based 4D Gaussian Splat representation. It introduces a four-stage pipeline: scene reconstruction with 4D Gaussians, language-grounding of appearance and motion, an LLM-driven editor, and a multi-agent path refinement to ensure globally coherent interactions. On the Waymo Open Dataset, it achieves state-of-the-art performance in road-object querying, the highest task-completion rate for editing prompts, and the lowest collision/failure rates due to predictive refinement. This approach enables intuitive natural-language editing of dynamic traffic scenes, including pedestrians, and offers a scalable foundation for realistic scenario generation in autonomous driving research.

Abstract

Driving scene manipulation with sensor data is emerging as a promising alternative to traditional virtual driving simulators. However, existing frameworks struggle to generate realistic scenarios efficiently due to limited editing capabilities. To address these challenges, we present SIMSplat, a predictive driving scene editor with language-aligned Gaussian splatting. As a language-controlled editor, SIMSplat enables intuitive manipulation using natural language prompts. By aligning language with Gaussian-reconstructed scenes, it further supports direct querying of road objects, allowing precise and flexible editing. Our method provides detailed object-level editing, including adding new objects and modifying the trajectories of both vehicles and pedestrians, while also incorporating predictive path refinement through multi-agent motion prediction to generate realistic interactions among all agents in the scene. Experiments on the Waymo dataset demonstrate SIMSplat's extensive editing capabilities and adaptability across a wide range of scenarios. Project page: https://sungyeonparkk.github.io/simsplat/

SIMSplat: Predictive Driving Scene Editing with Language-aligned 4D Gaussian Splatting

TL;DR

Abstract

SIMSplat: Predictive Driving Scene Editing with Language-aligned 4D Gaussian Splatting

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)