ArmGS: Composite Gaussian Appearance Refinement for Modeling Dynamic Urban Environments
Guile Wu, Dongfeng Bai, Bingbing Liu
TL;DR
ArmGS introduces multi-level appearance refinement for composite 3D Gaussian Splatting to model dynamic urban driving scenes. By refining Gaussians at local, global, and dynamic-actor levels with learned affine transformations and a lightweight spatial-temporal deformation head, it captures fine-grained appearance changes across frames and viewpoints while preserving differentiability. Empirical results on Waymo, KITTI, NOTR, and VKITTI2 show superior reconstruction and novel-view synthesis, as well as real-time rendering, with comprehensive ablations validating each refinement component. The approach advances efficient, photorealistic, and dynamically consistent autonomous driving scene simulation, enabling more realistic validation and testing of driving systems.
Abstract
This work focuses on modeling dynamic urban environments for autonomous driving simulation. Contemporary data-driven methods using neural radiance fields have achieved photorealistic driving scene modeling, but they suffer from low rendering efficacy. Recently, some approaches have explored 3D Gaussian splatting for modeling dynamic urban scenes, enabling high-fidelity reconstruction and real-time rendering. However, these approaches often neglect to model fine-grained variations between frames and camera viewpoints, leading to suboptimal results. In this work, we propose a new approach named ArmGS that exploits composite driving Gaussian splatting with multi-granularity appearance refinement for autonomous driving scene modeling. The core idea of our approach is devising a multi-level appearance modeling scheme to optimize a set of transformation parameters for composite Gaussian refinement from multiple granularities, ranging from local Gaussian level to global image level and dynamic actor level. This not only models global scene appearance variations between frames and camera viewpoints, but also models local fine-grained changes of background and objects. Extensive experiments on multiple challenging autonomous driving datasets, namely, Waymo, KITTI, NOTR and VKITTI2, demonstrate the superiority of our approach over the state-of-the-art methods.
