CSAD: Unsupervised Component Segmentation for Logical Anomaly Detection
Yu-Hsuan Hsieh, Shang-Hong Lai
TL;DR
CSAD introduces unsupervised semantic pseudo-label generation to train a lightweight component segmentation network for logical anomaly detection. It combines a Patch Histogram module with a Local-Global Student-Teacher (LGST) framework to detect both position/quantity-based and scale-based anomalies, achieving a total AUROC of 95.3% on MVTec LOCO AD with low latency (8.9 ms) and high throughput (321.8 fps). Semantic pseudo-labels are produced without human annotations via RAM++ and Grounded-SAM, followed by segmentation training with LSA augmentation and multiple losses. The method demonstrates strong practical impact by reducing labeling requirements while delivering state-of-the-art performance and efficiency in industrial anomaly detection tasks.
Abstract
To improve logical anomaly detection, some previous works have integrated segmentation techniques with conventional anomaly detection methods. Although these methods are effective, they frequently lead to unsatisfactory segmentation results and require manual annotations. To address these drawbacks, we develop an unsupervised component segmentation technique that leverages foundation models to autonomously generate training labels for a lightweight segmentation network without human labeling. Integrating this new segmentation technique with our proposed Patch Histogram module and the Local-Global Student-Teacher (LGST) module, we achieve a detection AUROC of 95.3% in the MVTec LOCO AD dataset, which surpasses previous SOTA methods. Furthermore, our proposed method provides lower latency and higher throughput than most existing approaches.
