Unified Few-shot Crack Segmentation and its Precise 3D Automatic Measurement in Concrete Structures
Pengru Deng, Jiapeng Yao, Chun Li, Su Wang, Xinrun Li, Varun Ojha, Xuhui He, Takashi Matsumoto
TL;DR
This paper addresses the challenge of robust, generalized concrete crack inspection across diverse environments by combining a few-shot crack segmentation approach with a foundation-model–driven refinement and a LiDAR–camera–IMU multi-sensor SLAM framework. It introduces a four-module system: calibrated multi-sensor data acquisition, 2D crack segmentation refined by SAM prompts, dense 3D crack reconstruction with MLS/SOR denoising, and automatic 3D crack width and localization measurements within the colored point cloud. The key contributions include a generalizable crack segmentation workflow leveraging SAM, a dense multi-frame multi-modal 3D reconstruction pipeline, and an automated 3D crack measurement method validated on field data with submillimeter accuracy and competitive reconstruction quality. The framework promises practical impact for on-site inspection and digital twin applications by delivering accurate, automated crack metrics directly in 3D space, under varied geometries and conditions.
Abstract
Visual-Spatial Systems has become increasingly essential in concrete crack inspection. However, existing methods often lacks adaptability to diverse scenarios, exhibits limited robustness in image-based approaches, and struggles with curved or complex geometries. To address these limitations, an innovative framework for two-dimensional (2D) crack detection, three-dimensional (3D) reconstruction, and 3D automatic crack measurement was proposed by integrating computer vision technologies and multi-modal Simultaneous localization and mapping (SLAM) in this study. Firstly, building on a base DeepLabv3+ segmentation model, and incorporating specific refinements utilizing foundation model Segment Anything Model (SAM), we developed a crack segmentation method with strong generalization across unfamiliar scenarios, enabling the generation of precise 2D crack masks. To enhance the accuracy and robustness of 3D reconstruction, Light Detection and Ranging (LiDAR) point clouds were utilized together with image data and segmentation masks. By leveraging both image- and LiDAR-SLAM, we developed a multi-frame and multi-modal fusion framework that produces dense, colorized point clouds, effectively capturing crack semantics at a 3D real-world scale. Furthermore, the crack geometric attributions were measured automatically and directly within 3D dense point cloud space, surpassing the limitations of conventional 2D image-based measurements. This advancement makes the method suitable for structural components with curved and complex 3D geometries. Experimental results across various concrete structures highlight the significant improvements and unique advantages of the proposed method, demonstrating its effectiveness, accuracy, and robustness in real-world applications.
