EfficientAD: Accurate Visual Anomaly Detection at Millisecond-Level Latencies
Kilian Batzner, Lars Heckler, Rebecca König
TL;DR
EfficientAD tackles real-time visual anomaly detection under strict latency constraints by combining a fast patch-based feature extractor (PDN) with a lightweight student–teacher detector and an autoencoder to capture logical constraints. It introduces a loss-driven asymmetry (hard feature loss) and a pretraining penalty to boost detection without increasing test-time cost, and it employs a calibrated fusion of local (student–teacher) and global (autoencoder) anomaly maps. Evaluated on 32 industrial datasets, EfficientAD achieves millisecond-scale latency and high throughput while delivering state-of-the-art detection and localization performance. This makes it a practical baseline for real-world manufacturing QA and a strong foundation for further research into efficient, explainable anomaly detection.
Abstract
Detecting anomalies in images is an important task, especially in real-time computer vision applications. In this work, we focus on computational efficiency and propose a lightweight feature extractor that processes an image in less than a millisecond on a modern GPU. We then use a student-teacher approach to detect anomalous features. We train a student network to predict the extracted features of normal, i.e., anomaly-free training images. The detection of anomalies at test time is enabled by the student failing to predict their features. We propose a training loss that hinders the student from imitating the teacher feature extractor beyond the normal images. It allows us to drastically reduce the computational cost of the student-teacher model, while improving the detection of anomalous features. We furthermore address the detection of challenging logical anomalies that involve invalid combinations of normal local features, for example, a wrong ordering of objects. We detect these anomalies by efficiently incorporating an autoencoder that analyzes images globally. We evaluate our method, called EfficientAD, on 32 datasets from three industrial anomaly detection dataset collections. EfficientAD sets new standards for both the detection and the localization of anomalies. At a latency of two milliseconds and a throughput of six hundred images per second, it enables a fast handling of anomalies. Together with its low error rate, this makes it an economical solution for real-world applications and a fruitful basis for future research.
