SuperSimpleNet: Unifying Unsupervised and Supervised Learning for Fast and Reliable Surface Defect Detection
Blaž Rolih, Matic Fučka, Danijel Skočaj
TL;DR
Surface defect detection requires high accuracy, robustness, and fast operation while leveraging all available data. SuperSimpleNet unifies unsupervised and supervised learning by extending SimpleNet with a feature-space anomaly generator, an upscaling-capable feature extractor, a feature adaptor, and a segmentation-detection pipeline that includes a global classification head. It achieves state-of-the-art results on both supervised (SensumSODF, KSDD2) and unsupervised (MVTec AD, VisA) benchmarks, while maintaining a fast inference time of about 9.3 ms and 268 images per second. The approach demonstrates strong robustness and stability across training runs and data regimes, with ablation analyses clarifying the contribution of each architectural component and training strategy to performance and efficiency.
Abstract
The aim of surface defect detection is to identify and localise abnormal regions on the surfaces of captured objects, a task that's increasingly demanded across various industries. Current approaches frequently fail to fulfil the extensive demands of these industries, which encompass high performance, consistency, and fast operation, along with the capacity to leverage the entirety of the available training data. Addressing these gaps, we introduce SuperSimpleNet, an innovative discriminative model that evolved from SimpleNet. This advanced model significantly enhances its predecessor's training consistency, inference time, as well as detection performance. SuperSimpleNet operates in an unsupervised manner using only normal training images but also benefits from labelled abnormal training images when they are available. SuperSimpleNet achieves state-of-the-art results in both the supervised and the unsupervised settings, as demonstrated by experiments across four challenging benchmark datasets. Code: https://github.com/blaz-r/SuperSimpleNet .
