SplatPose & Detect: Pose-Agnostic 3D Anomaly Detection
Mathis Kruse, Marco Rudolph, Dominik Woiwode, Bodo Rosenhahn
TL;DR
This work tackles pose-variant 3D anomaly detection by encoding multi-view objects as a 3D Gaussian splat cloud and refining pose via differentiable SE(3) transformations. The approach enables rendering defect-free views at arbitrary poses and detects anomalies through cross-view feature comparisons, achieving state-of-the-art speed and accuracy on the MAD benchmark. By significantly reducing training and inference costs compared to NeRF-based and OmniAD baselines, SplatPose demonstrates strong data efficiency, including robust performance with sparse training data. The method’s practical impact lies in enabling fast, pose-robust 3D anomaly detection suitable for industrial deployment and real-time QA workflows.
Abstract
Detecting anomalies in images has become a well-explored problem in both academia and industry. State-of-the-art algorithms are able to detect defects in increasingly difficult settings and data modalities. However, most current methods are not suited to address 3D objects captured from differing poses. While solutions using Neural Radiance Fields (NeRFs) have been proposed, they suffer from excessive computation requirements, which hinder real-world usability. For this reason, we propose the novel 3D Gaussian splatting-based framework SplatPose which, given multi-view images of a 3D object, accurately estimates the pose of unseen views in a differentiable manner, and detects anomalies in them. We achieve state-of-the-art results in both training and inference speed, and detection performance, even when using less training data than competing methods. We thoroughly evaluate our framework using the recently proposed Pose-agnostic Anomaly Detection benchmark and its multi-pose anomaly detection (MAD) data set.
