Speak the Same Language: Global LiDAR Registration on BIM Using Pose Hough Transform
Zhijian Qiao, Haoming Huang, Chuhao Liu, Zehuan Yu, Shaojie Shen, Fumin Zhang, Huan Yin
TL;DR
The paper addresses the problem of aligning LiDAR scans to BIM models in a shared reference frame, enabling cross-modal, global registration without external infrastructure. It introduces a front-end that extracts walls and corners to form triangle descriptors with $O(1)$ retrieval, and a back-end that uses a Pose Hough Transform on $SE(2)$ with hierarchical voting to generate multiple pose candidates. The optimal transformation is selected via an occupancy-aware verification score that compare BIM and LiDAR occupancy, improving robustness to as-built versus as-designed deviations. Real-world experiments in a large university building with two LiDAR sensors demonstrate the method's accuracy and efficiency, and the authors provide open-source code and expanded datasets to facilitate further research.
Abstract
Light detection and ranging (LiDAR) point clouds and building information modeling (BIM) represent two distinct data modalities in the fields of robot perception and construction. These modalities originate from different sources and are associated with unique reference frames. The primary goal of this study is to align these modalities within a shared reference frame using a global registration approach, effectively enabling them to ``speak the same language''. To achieve this, we propose a cross-modality registration method, spanning from the front end to the back end. At the front end, we extract triangle descriptors by identifying walls and intersected corners, enabling the matching of corner triplets with a complexity independent of the BIM's size. For the back-end transformation estimation, we utilize the Hough transform to map the matched triplets to the transformation space and introduce a hierarchical voting mechanism to hypothesize multiple pose candidates. The final transformation is then verified using our designed occupancy-aware scoring method. To assess the effectiveness of our approach, we conducted real-world multi-session experiments in a large-scale university building, employing two different types of LiDAR sensors. We make the collected datasets and codes publicly available to benefit the community.
