Towards Zero-shot 3D Anomaly Localization
Yizhou Wang, Kuan-Chuan Peng, Yun Fu
TL;DR
This work tackles zero-shot 3D anomaly localization, where the target class lacks normal training data. It introduces 3DzAL, a patch-level contrastive framework that uses pseudo anomalies generated from task-irrelevant 3D data, a memory-bank with RGB, FPFH, and learnable 3D features, and a 3D normalcy classifier augmented by adversarial perturbations to produce robust anomaly scores. A key insight is that a randomly initialized CNN exhibits an inductive bias that localizes regions of interest in 3D XYZ data, enabling effective pseudo anomaly synthesis without pre-trained 3D models. Across 90 zero-shot trials on the MVTec 3D-AD dataset, 3DzAL achieves state-of-the-art pixel-level and image-level metrics, demonstrating strong generalization to unseen classes and validating the utility of task-irrelevant data and input perturbations for zero-shot 3D anomaly detection and localization.
Abstract
3D anomaly detection and localization is of great significance for industrial inspection. Prior 3D anomaly detection and localization methods focus on the setting that the testing data share the same category as the training data which is normal. However, in real-world applications, the normal training data for the target 3D objects can be unavailable due to issues like data privacy or export control regulation. To tackle these challenges, we identify a new task -- zero-shot 3D anomaly detection and localization, where the training and testing classes do not overlap. To this end, we design 3DzAL, a novel patch-level contrastive learning framework based on pseudo anomalies generated using the inductive bias from task-irrelevant 3D xyz data to learn more representative feature representations. Furthermore, we train a normalcy classifier network to classify the normal patches and pseudo anomalies and utilize the classification result jointly with feature distance to design anomaly scores. Instead of directly using the patch point clouds, we introduce adversarial perturbations to the input patch xyz data before feeding into the 3D normalcy classifier for the classification-based anomaly score. We show that 3DzAL outperforms the state-of-the-art anomaly detection and localization performance.
