IFTD: Image Feature Triangle Descriptor for Loop Detection in Driving Scenes
Fengtian Lang, Ruiye Ming, Zikang Yuan, Xin Yang
TL;DR
The paper tackles robust, real-time loop detection for driving scenes using LiDAR data. It introduces IFTD, a BEV-based Image Feature Triangle Descriptor built from Shi-Tomasi points on BEV projections, enabling rotation- and translation-invariant matching and a $4$-DOF pose estimation between keyframes. A two-stage verification combines a rapid hash-voxel candidate search over triangle sides with a BEV image-similarity check and SVD-based pose estimation under RANSAC to confirm loops. Experiments on KITTI, Mulran, and NCLT show IFTD outperforms state-of-the-art methods (STD and Contour Context) in accuracy and robustness while achieving about a $50 imes$ faster runtime than STD, highlighting its suitability for real-time autonomous driving applications. This approach demonstrates that BEV-derived triangle descriptors can provide strong geometric cues with low overhead for scalable loop closure in complex environments, and the authors release the code to foster community development.
Abstract
In this work, we propose a fast and robust Image Feature Triangle Descriptor (IFTD) based on the STD method, aimed at improving the efficiency and accuracy of place recognition in driving scenarios. We extract keypoints from BEV projection image of point cloud and construct these keypoints into triangle descriptors. By matching these feature triangles, we achieved precise place recognition and calculated the 4-DOF pose estimation between two keyframes. Furthermore, we employ image similarity inspection to perform the final place recognition. Experimental results on three public datasets demonstrate that our IFTD can achieve greater robustness and accuracy than state-of-the-art methods with low computational overhead.
