Toward Multi-class Anomaly Detection: Exploring Class-aware Unified Model against Inter-class Interference
Xi Jiang, Ying Chen, Qiang Nie, Jianlin Liu, Yong Liu, Chengjie Wang, Feng Zheng
TL;DR
The paper addresses multi-class anomaly detection by mitigating inter-class interference in unified models. It introduces $MINT$-$AD$, a transformer-based reconstruction framework that employs a class-aware implicit neural representation to generate per-class queries, supervised by $L_{CE}$ and a prior distribution loss $L_{Prior}$ alongside the reconstruction loss $L_{MSE}$. The approach achieves state-of-the-art or competitive results on MVTec-AD, VisA, CIFAR-10, and a larger unified dataset, with improved anomaly localization and robustness to background noise. These results demonstrate the feasibility of leveraging category information during training with INR to enable scalable, robust multi-class anomaly detection for industrial inspection.
Abstract
In the context of high usability in single-class anomaly detection models, recent academic research has become concerned about the more complex multi-class anomaly detection. Although several papers have designed unified models for this task, they often overlook the utility of class labels, a potent tool for mitigating inter-class interference. To address this issue, we introduce a Multi-class Implicit Neural representation Transformer for unified Anomaly Detection (MINT-AD), which leverages the fine-grained category information in the training stage. By learning the multi-class distributions, the model generates class-aware query embeddings for the transformer decoder, mitigating inter-class interference within the reconstruction model. Utilizing such an implicit neural representation network, MINT-AD can project category and position information into a feature embedding space, further supervised by classification and prior probability loss functions. Experimental results on multiple datasets demonstrate that MINT-AD outperforms existing unified training models.
