Feature Attenuation of Defective Representation Can Resolve Incomplete Masking on Anomaly Detection
YeongHyeon Park, Sungho Kang, Myung Jin Kim, Hyeong Seok Kim, Juneho Yi
TL;DR
This work addresses incomplete masking in reconstruction-by-inpainting-based unsupervised anomaly detection (UAD) by introducing FADeR, a lightweight two-layer MLP that attenuates defective representations through soft masking on skip connections. Trained with an active-learning strategy that derives patch-wise attenuation signals from reconstruction errors, FADeR does not require labeled defect data. Integrated with a pre-trained UAD model using a single deterministic mask, FADeR improves image- and pixel-level AUROC on MVTec AD and VisA and demonstrates plug-and-play compatibility with other masking approaches. The results show that a compact, edge-friendly module can significantly enhance UAD performance without large-scale architectural changes, enabling practical deployment in industrial settings.
Abstract
In unsupervised anomaly detection (UAD) research, while state-of-the-art models have reached a saturation point with extensive studies on public benchmark datasets, they adopt large-scale tailor-made neural networks (NN) for detection performance or pursued unified models for various tasks. Towards edge computing, it is necessary to develop a computationally efficient and scalable solution that avoids large-scale complex NNs. Motivated by this, we aim to optimize the UAD performance with minimal changes to NN settings. Thus, we revisit the reconstruction-by-inpainting approach and rethink to improve it by analyzing strengths and weaknesses. The strength of the SOTA methods is a single deterministic masking approach that addresses the challenges of random multiple masking that is inference latency and output inconsistency. Nevertheless, the issue of failure to provide a mask to completely cover anomalous regions is a remaining weakness. To mitigate this issue, we propose Feature Attenuation of Defective Representation (FADeR) that only employs two MLP layers which attenuates feature information of anomaly reconstruction during decoding. By leveraging FADeR, features of unseen anomaly patterns are reconstructed into seen normal patterns, reducing false alarms. Experimental results demonstrate that FADeR achieves enhanced performance compared to similar-scale NNs. Furthermore, our approach exhibits scalability in performance enhancement when integrated with other single deterministic masking methods in a plug-and-play manner.
