EC3R-SLAM: Efficient and Consistent Monocular Dense SLAM with Feed-Forward 3D Reconstruction

Lingxiang Hu; Naima Ait Oufroukh; Fabien Bonardi; Raymond Ghandour

EC3R-SLAM: Efficient and Consistent Monocular Dense SLAM with Feed-Forward 3D Reconstruction

Lingxiang Hu, Naima Ait Oufroukh, Fabien Bonardi, Raymond Ghandour

TL;DR

EC3R-SLAM addresses the bottlenecks of latency and GPU memory in monocular dense SLAM by delivering calibration-free operation through a tight coupling of lightweight tracking with a feed-forward 3D reconstruction backend. It introduces both local and global loop-closure mechanisms to enforce multi-view consistency while simultaneously estimating intrinsics, enabling real-time, memory-efficient dense mapping. The proposed approach achieves competitive accuracy on standard benchmarks (TUM-RGBD, 7-Scenes, Replica) with significantly lower VRAM usage and robust performance on resource-constrained hardware. The combination of local sparse tracking, submap-level feed-forward reconstruction, and comprehensive loop closures yields strong generalization and practical applicability to real-world robotics and AR/VR tasks.

Abstract

The application of monocular dense Simultaneous Localization and Mapping (SLAM) is often hindered by high latency, large GPU memory consumption, and reliance on camera calibration. To relax this constraint, we propose EC3R-SLAM, a novel calibration-free monocular dense SLAM framework that jointly achieves high localization and mapping accuracy, low latency, and low GPU memory consumption. This enables the framework to achieve efficiency through the coupling of a tracking module, which maintains a sparse map of feature points, and a mapping module based on a feed-forward 3D reconstruction model that simultaneously estimates camera intrinsics. In addition, both local and global loop closures are incorporated to ensure mid-term and long-term data association, enforcing multi-view consistency and thereby enhancing the overall accuracy and robustness of the system. Experiments across multiple benchmarks show that EC3R-SLAM achieves competitive performance compared to state-of-the-art methods, while being faster and more memory-efficient. Moreover, it runs effectively even on resource-constrained platforms such as laptops and Jetson Orin NX, highlighting its potential for real-world robotics applications.

EC3R-SLAM: Efficient and Consistent Monocular Dense SLAM with Feed-Forward 3D Reconstruction

TL;DR

Abstract

EC3R-SLAM: Efficient and Consistent Monocular Dense SLAM with Feed-Forward 3D Reconstruction

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (9)