DH-PTAM: A Deep Hybrid Stereo Events-Frames Parallel Tracking And Mapping System
Abanob Soliman, Fabien Bonardi, Désiré Sidibé, Samia Bouchafa
TL;DR
DH-PTAM addresses robust SLAM in challenging HDR and dynamic environments by fusing stereo frames with stereo event streams in a unified reference frame. It introduces spatio-temporal synchronization, a depth-aware spatial hybridization, and an end-to-end optimization backbone with learning-based descriptors and a lightweight loop-closure mechanism. Experimental results on the VECtor and TUM-VIE benchmarks show improved accuracy and robustness over RGB-only and some event-based baselines, particularly in HDR scenarios, with GPU-enabled front-ends offering further gains. The work provides a practical, scalable DH-PTAM implementation and outlines directions for online synchronization optimization and adaptive temporal-windowing to enhance real-time performance.
Abstract
This paper presents a robust approach for a visual parallel tracking and mapping (PTAM) system that excels in challenging environments. Our proposed method combines the strengths of heterogeneous multi-modal visual sensors, including stereo event-based and frame-based sensors, in a unified reference frame through a novel spatio-temporal synchronization of stereo visual frames and stereo event streams. We employ deep learning-based feature extraction and description for estimation to enhance robustness further. We also introduce an end-to-end parallel tracking and mapping optimization layer complemented by a simple loop-closure algorithm for efficient SLAM behavior. Through comprehensive experiments on both small-scale and large-scale real-world sequences of VECtor and TUM-VIE benchmarks, our proposed method (DH-PTAM) demonstrates superior performance in terms of robustness and accuracy in adverse conditions, especially in large-scale HDR scenarios. Our implementation's research-based Python API is publicly available on GitHub for further research and development: https://github.com/AbanobSoliman/DH-PTAM.
