DDN-SLAM: Real-time Dense Dynamic Neural Implicit SLAM
Mingrui Li, Yiming Zhou, Guangan Jiang, Tianchen Deng, Yangyang Wang, Hongyu Wang
TL;DR
DDN-SLAM addresses the core challenge of dynamic interference in neural implicit SLAM by integrating semantic understanding with a Gaussian Mixture depth prior to differentiate dynamic, static, and potentially static regions. It introduces a three-part pipeline: (i) semantic-guided segmentation with a depth-based two-Gaussian model and EM updates for robust foreground/background labeling, (ii) mixed background restoration combining optical-flow-based inpainting with sparse-point–guided sampling to preserve static structure, and (iii) dynamic NeRF rendering with a dedicated loss that enforces motion consistency and minimizes occlusion artifacts. The approach achieves real-time performance (~20 Hz) on monocular, stereo, and RGB-D inputs and demonstrates strong results across dynamic and challenging indoor scenes, including a reported average 90% improvement in ATE over prior neural implicit SLAM methods. By preserving potential dynamic objects and constraining dynamic occlusions, DDN-SLAM provides more complete, high-fidelity reconstructions suitable for robotics and AR/VR applications in dynamic environments.
Abstract
SLAM systems based on NeRF have demonstrated superior performance in rendering quality and scene reconstruction for static environments compared to traditional dense SLAM. However, they encounter tracking drift and mapping errors in real-world scenarios with dynamic interferences. To address these issues, we introduce DDN-SLAM, the first real-time dense dynamic neural implicit SLAM system integrating semantic features. To address dynamic tracking interferences, we propose a feature point segmentation method that combines semantic features with a mixed Gaussian distribution model. To avoid incorrect background removal, we propose a mapping strategy based on sparse point cloud sampling and background restoration. We propose a dynamic semantic loss to eliminate dynamic occlusions. Experimental results demonstrate that DDN-SLAM is capable of robustly tracking and producing high-quality reconstructions in dynamic environments, while appropriately preserving potential dynamic objects. Compared to existing neural implicit SLAM systems, the tracking results on dynamic datasets indicate an average 90% improvement in Average Trajectory Error (ATE) accuracy.
