Object detection characteristics in a learning factory environment using YOLOv8
Toni Schneidereit, Stefan Gohrenz, Michael Breuß
TL;DR
This study addresses industrial object detection in a learning factory by systematically varying object appearance across 15 workpieces and training 92 YOLOv8 models (in both nano and extra-large variants) to assess background and material influences on detection. It combines single-class and multi-class training with Layer-wise Relevance Propagation (LRP) heatmaps to probe explainability and potential overfitting, using $mAP$ and $IoU$-based metrics for evaluation. Key findings show that some materials (e.g., yellow/white plastics, light wood) are detected reliably, while others (aluminium, steel, black, red plastics) are challenging, and background/reflective cues can mislead predictions. The work provides a challenging, well-documented dataset and insights that inform dataset design, model explainability, and the development of robust industrial detectors for complex factory environments.
Abstract
AI-based object detection, and efforts to explain and investigate their characteristics, is a topic of high interest. The impact of, e.g., complex background structures with similar appearances as the objects of interest, on the detection accuracy and, beforehand, the necessary dataset composition are topics of ongoing research. In this paper, we present a systematic investigation of background influences and different features of the object to be detected. The latter includes various materials and surfaces, partially transparent and with shiny reflections in the context of an Industry 4.0 learning factory. Different YOLOv8 models have been trained for each of the materials on different sized datasets, where the appearance was the only changing parameter. In the end, similar characteristics tend to show different behaviours and sometimes unexpected results. While some background components tend to be detected, others with the same features are not part of the detection. Additionally, some more precise conclusions can be drawn from the results. Therefore, we contribute a challenging dataset with detailed investigations on 92 trained YOLO models, addressing some issues on the detection accuracy and possible overfitting.
