Table of Contents
Fetching ...

Weakly-Supervised Surface Crack Segmentation by Generating Pseudo-Labels using Localization with a Classifier and Thresholding

Jacob König, Mark Jenkins, Mike Mannion, Peter Barrie, Gordon Morison

TL;DR

This work proposes a weakly supervised approach that leverages a CNN classifier in a novel way to create surface crack pseudo labels that achieve comparable performance to fully supervised methods on four popular crack segmentation datasets.

Abstract

Surface cracks are a common sight on public infrastructure nowadays. Recent work has been addressing this problem by supporting structural maintenance measures using machine learning methods. Those methods are used to segment surface cracks from their background, making them easier to localize. However, a common issue is that to create a well-functioning algorithm, the training data needs to have detailed annotations of pixels that belong to cracks. Our work proposes a weakly supervised approach that leverages a CNN classifier in a novel way to create surface crack pseudo labels. First, we use the classifier to create a rough crack localization map by using its class activation maps and a patch based classification approach and fuse this with a thresholding based approach to segment the mostly darker crack pixels. The classifier assists in suppressing noise from the background regions, which commonly are incorrectly highlighted as cracks by standard thresholding methods. Then, the pseudo labels can be used in an end-to-end approach when training a standard CNN for surface crack segmentation. Our method is shown to yield sufficiently accurate pseudo labels. Those labels, incorporated into segmentation CNN training using multiple recent crack segmentation architectures, achieve comparable performance to fully supervised methods on four popular crack segmentation datasets.

Weakly-Supervised Surface Crack Segmentation by Generating Pseudo-Labels using Localization with a Classifier and Thresholding

TL;DR

This work proposes a weakly supervised approach that leverages a CNN classifier in a novel way to create surface crack pseudo labels that achieve comparable performance to fully supervised methods on four popular crack segmentation datasets.

Abstract

Surface cracks are a common sight on public infrastructure nowadays. Recent work has been addressing this problem by supporting structural maintenance measures using machine learning methods. Those methods are used to segment surface cracks from their background, making them easier to localize. However, a common issue is that to create a well-functioning algorithm, the training data needs to have detailed annotations of pixels that belong to cracks. Our work proposes a weakly supervised approach that leverages a CNN classifier in a novel way to create surface crack pseudo labels. First, we use the classifier to create a rough crack localization map by using its class activation maps and a patch based classification approach and fuse this with a thresholding based approach to segment the mostly darker crack pixels. The classifier assists in suppressing noise from the background regions, which commonly are incorrectly highlighted as cracks by standard thresholding methods. Then, the pseudo labels can be used in an end-to-end approach when training a standard CNN for surface crack segmentation. Our method is shown to yield sufficiently accurate pseudo labels. Those labels, incorporated into segmentation CNN training using multiple recent crack segmentation architectures, achieve comparable performance to fully supervised methods on four popular crack segmentation datasets.

Paper Structure

This paper contains 26 sections, 11 equations, 8 figures, 9 tables.

Figures (8)

  • Figure 1: Overview of our proposed, weakly-supervised approach to create accurate segmentation maps when only classification labels are available.
  • Figure 2: Interim outputs of the steps to create the coarse localization map using the classifier.
  • Figure 3: Samples of performing different thresholding methods on a crack-image. Multi Otsu uses three classes. The Niblack and Sauvola methods use a window size of 33. Patches-Ours: Our local patch approach in which each patch is thresholded with a window size of 32 and a stride of 8 followed by combining the outputs.
  • Figure 4: Example output that has been generated by merging the localization map and the thresholded segmentation.
  • Figure 5: Comparison of the label quality with the ground truth of our proposed method and the CRF based method from dong2020PatchBasedWeakly.
  • ...and 3 more figures