NID-SLAM: Neural Implicit Representation-based RGB-D SLAM in dynamic environments
Ziheng Xu, Jianwei Niu, Qingfeng Li, Tao Ren, Chen Chen
TL;DR
NID-SLAM tackles dynamic-object interruptions in RGB-D SLAM by combining depth-guided semantic mask refinement, depth-based object removal, and background inpainting with a neural implicit scene representation. It introduces a depth revision mechanism (thresholded depth gradients), depth-aware mask refinement, and a dynamic-scene–oriented keyframe strategy to improve tracking robustness and mapping completeness. The method uses multiresolution feature grids with ray-based rendering and joint optimization of geometry, color, and camera poses, achieving state-of-the-art tracking accuracy among neural SLAMs on dynamic datasets and producing higher-quality maps than baselines, albeit with speed limited by segmentation. This work advances practical neural SLAM in dynamic environments and enables reusable static maps, with potential for real-time improvements through faster segmentation and predictive inpainting.
Abstract
Neural implicit representations have been explored to enhance visual SLAM algorithms, especially in providing high-fidelity dense map. Existing methods operate robustly in static scenes but struggle with the disruption caused by moving objects. In this paper we present NID-SLAM, which significantly improves the performance of neural SLAM in dynamic environments. We propose a new approach to enhance inaccurate regions in semantic masks, particularly in marginal areas. Utilizing the geometric information present in depth images, this method enables accurate removal of dynamic objects, thereby reducing the probability of camera drift. Additionally, we introduce a keyframe selection strategy for dynamic scenes, which enhances camera tracking robustness against large-scale objects and improves the efficiency of mapping. Experiments on publicly available RGB-D datasets demonstrate that our method outperforms competitive neural SLAM approaches in tracking accuracy and mapping quality in dynamic environments.
