ADD-SLAM: Adaptive Dynamic Dense SLAM with Gaussian Splatting
Wenhua Wu, Chenpeng Su, Siting Zhu, Tianchen Deng, Zhe Liu, Hesheng Wang
TL;DR
ADD-SLAM tackles dynamic scene challenges in dense SLAM by exploiting scene-consistency between observations and a historical Gaussian map to identify dynamics without semantic priors. It represents the scene with 3D Gaussian splats, maintaining a static map G_s and per-object dynamic maps G_d^{id}(t), and uses rendering-driven inconsistencies to drive adaptive dynamic segmentation and 2D object tracking. A dynamic-static composite mapping pipeline removes dynamic regions from the static map while online densifying static regions and constructing a per-object temporal Gaussian model for dynamics, all underpinned by a camera-tracking objective and loop-closure with DBA. Across Bonn, DAVIS, and TUM RGB-D, ADD-SLAM achieves state-of-the-art localization and rich dynamic object modeling, confirming its practical value for robotic perception in dynamic environments.
Abstract
Recent advancements in Neural Radiance Fields (NeRF) and 3D Gaussian-based Simultaneous Localization and Mapping (SLAM) methods have demonstrated exceptional localization precision and remarkable dense mapping performance. However, dynamic objects introduce critical challenges by disrupting scene consistency, leading to tracking drift and mapping artifacts. Existing methods that employ semantic segmentation or object detection for dynamic identification and filtering typically rely on predefined categorical priors, while discarding dynamic scene information crucial for robotic applications such as dynamic obstacle avoidance and environmental interaction. To overcome these challenges, we propose ADD-SLAM: an Adaptive Dynamic Dense SLAM framework based on Gaussian splitting. We design an adaptive dynamic identification mechanism grounded in scene consistency analysis, comparing geometric and textural discrepancies between real-time observations and historical maps. Ours requires no predefined semantic category priors and adaptively discovers scene dynamics. Precise dynamic object recognition effectively mitigates interference from moving targets during localization. Furthermore, we propose a dynamic-static separation mapping strategy that constructs a temporal Gaussian model to achieve online incremental dynamic modeling. Experiments conducted on multiple dynamic datasets demonstrate our method's flexible and accurate dynamic segmentation capabilities, along with state-of-the-art performance in both localization and mapping.
