VDG: Vision-Only Dynamic Gaussian for Driving Simulation

Hao Li; Jingfeng Li; Dingwen Zhang; Chenming Wu; Jieqi Shi; Chen Zhao; Haocheng Feng; Errui Ding; Jingdong Wang; Junwei Han

VDG: Vision-Only Dynamic Gaussian for Driving Simulation

Hao Li, Jingfeng Li, Dingwen Zhang, Chenming Wu, Jieqi Shi, Chen Zhao, Haocheng Feng, Errui Ding, Jingdong Wang, Junwei Han

TL;DR

This work tackles scalable driving simulation without pose priors by introducing Vision-only Dynamic Gaussian (VDG), which leverages self-supervised visual odometry for pose and dense monocular depth alongside a 3D Gaussian Splatting representation. It couples RGB/depth supervision with motion-mask guidance to decompose scenes into static and dynamic components and refines camera poses during training. The method demonstrates strong performance on Waymo and KITTI in both dynamic view synthesis and pose estimation, outperforming pose-free baselines and approaching GT-pose methods while enabling RGB-only input. Overall, VDG offers a practical, faster, and scalable pathway for driving simulation that bypasses LiDAR and precomputed poses, broadening accessibility for sim-to-real research and large-scale urban scenarios.

Abstract

Dynamic Gaussian splatting has led to impressive scene reconstruction and image synthesis advances in novel views. Existing methods, however, heavily rely on pre-computed poses and Gaussian initialization by Structure from Motion (SfM) algorithms or expensive sensors. For the first time, this paper addresses this issue by integrating self-supervised VO into our pose-free dynamic Gaussian method (VDG) to boost pose and depth initialization and static-dynamic decomposition. Moreover, VDG can work with only RGB image input and construct dynamic scenes at a faster speed and larger scenes compared with the pose-free dynamic view-synthesis method. We demonstrate the robustness of our approach via extensive quantitative and qualitative experiments. Our results show favorable performance over the state-of-the-art dynamic view synthesis methods. Additional video and source code will be posted on our project page at https://3d-aigc.github.io/VDG.

VDG: Vision-Only Dynamic Gaussian for Driving Simulation

TL;DR

Abstract

VDG: Vision-Only Dynamic Gaussian for Driving Simulation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (9)