XAI-Enhanced Semantic Segmentation Models for Visual Quality Inspection
Tobias Clement, Truong Thanh Hung Nguyen, Mohamed Abdelaal, Hung Cao
TL;DR
The paper tackles the interpretability gap in visual quality inspection by introducing a CAM-based Explainable AI framework to refine semantic segmentation models, notably DeepLabv3-ResNet101, for more trustworthy defect detection. It details a four-stage workflow—training, CAM-based explanations, XAI evaluation, and annotation augmentation guided by explanations—applied to the TTPLA dataset in a cloud/mobile VQI setting. Through systematic evaluation, HiResCAM emerges as the most faithful and efficient XAI method, guiding an annotation-augmentation strategy that yields measurable IoU gains, particularly for challenging objects like cables. The results demonstrate that XAI-driven enhancements can improve segmentation quality while maintaining usability, paving the way for more transparent, robust VQI systems in manufacturing contexts.
Abstract
Visual quality inspection systems, crucial in sectors like manufacturing and logistics, employ computer vision and machine learning for precise, rapid defect detection. However, their unexplained nature can hinder trust, error identification, and system improvement. This paper presents a framework to bolster visual quality inspection by using CAM-based explanations to refine semantic segmentation models. Our approach consists of 1) Model Training, 2) XAI-based Model Explanation, 3) XAI Evaluation, and 4) Annotation Augmentation for Model Enhancement, informed by explanations and expert insights. Evaluations show XAI-enhanced models surpass original DeepLabv3-ResNet101 models, especially in intricate object segmentation.
