MLP-SLAM: Multilayer Perceptron-Based Simultaneous Localization and Mapping
Taozhe Li, Wei Sun
TL;DR
This work addresses the challenge of SLAM performance degradation in dynamic outdoor environments by introducing an open-source MLP-based real-time stereo SLAM system that discriminates dynamic vs static feature points to preserve informative geometry. Built on ORB-SLAM2, the system adds a Depth Filter Module, a coarse pose estimation stage with object detection and tracking, an MLP-based discriminator using three error-based features, and a fine estimation stage that refines pose using static features. A new public dataset with over 50,000 labeled feature points enables direct evaluation of dynamic/static discrimination, and the method demonstrates superior average precision and faster speed on KITTI benchmarks compared with state-of-the-art dynamic SLAM methods. The authors also provide code and datasets on GitHub for reproducibility and broader usage.
Abstract
The Visual Simultaneous Localization and Mapping (V-SLAM) system has seen significant development in recent years, demonstrating high precision in environments with limited dynamic objects. However, their performance significantly deteriorates when deployed in settings with a higher presence of movable objects, such as environments with pedestrians, cars, and buses, which are common in outdoor scenes. To address this issue, we propose a Multilayer Perceptron (MLP)-based real-time stereo SLAM system that leverages complete geometry information to avoid information loss. Moreover, there is currently no publicly available dataset for directly evaluating the effectiveness of dynamic and static feature classification methods, and to bridge this gap, we have created a publicly available dataset containing over 50,000 feature points. Experimental results demonstrate that our MLP-based dynamic and static feature point discriminator has achieved superior performance compared to other methods on this dataset. Furthermore, the MLP-based real-time stereo SLAM system has shown the highest average precision and fastest speed on the outdoor KITTI tracking datasets compared to other dynamic SLAM systems.The open-source code and datasets are available at https://github.com/TaozheLi/MLP-SLAM.
