Evaluation of Data Augmentation and Loss Functions in Semantic Image Segmentation for Drilling Tool Wear Detection
Elke Schlager, Andreas Windisch, Lukas Hanna, Thomas Klünsner, Elias Jan Hagendorfer, Tamara Teppernegg
TL;DR
The paper tackles direct wear detection in drilling tools via a U-Net segmentation pipeline applied to microscopy images. It systematically evaluates binary versus multiclass wear definitions, three loss functions ($CE$, $FCE$, and an $IoU$-based loss), two tile sizes, and three augmentation regimes, using 5-fold cross-validation across 72 configurations. Key findings show that binary models with batch normalization and an $IoU$-based loss trained with moderately augmented data achieve strong performance, with final $IoU$ around $0.886$ for 512 px tiles and $0.904$ for 256 px tiles; multiclass configurations can match or exceed binary performance in some settings, while full augmentation can reduce accuracy. The approach enables robust wear-area segmentation and the extraction of wear metrics directly from segmentation masks, with larger tiles offering resilience to reflections, highlighting practical value for automated tool wear monitoring and process optimization.
Abstract
Tool wear monitoring is crucial for quality control and cost reduction in manufacturing processes, of which drilling applications are one example. In this paper, we present a U-Net based semantic image segmentation pipeline, deployed on microscopy images of cutting inserts, for the purpose of wear detection. The wear area is differentiated in two different types, resulting in a multiclass classification problem. Joining the two wear types in one general wear class, on the other hand, allows the problem to be formulated as a binary classification task. Apart from the comparison of the binary and multiclass problem, also different loss functions, i. e., Cross Entropy, Focal Cross Entropy, and a loss based on the Intersection over Union (IoU), are investigated. Furthermore, models are trained on image tiles of different sizes, and augmentation techniques of varying intensities are deployed. We find, that the best performing models are binary models, trained on data with moderate augmentation and an IoU-based loss function.
