Odd-One-Out: Anomaly Detection by Comparing with Neighbors
Ankan Bhunia, Changjian Li, Hakan Bilen
TL;DR
This work introduces Odd-One-Out, a scene-specific anomaly detection framework that identifies oddly behaving objects by comparing multiple instances within the same scene using multi-view imagery. It builds a 3D object-centric feature volume from multiple views, enhances it via differentiable rendering and DINOv2 feature distillation, and performs cross-instance matching with sparse voxel attention to predict per-object anomalies and 3D locations. The approach is evaluated on two new benchmarks, ToysAD-8K and PartsAD-15K, and shows strong generalization, especially to unseen categories, outperforming reconstruction-based and multi-view baselines. The work advances practical AD by leveraging inter-object context and robust 3D representations, with potential impact on manufacturing quality control and related domains.
Abstract
This paper introduces a novel anomaly detection (AD) problem aimed at identifying `odd-looking' objects within a scene by comparing them to other objects present. Unlike traditional AD benchmarks with fixed anomaly criteria, our task detects anomalies specific to each scene by inferring a reference group of regular objects. To address occlusions, we use multiple views of each scene as input, construct 3D object-centric models for each instance from 2D views, enhancing these models with geometrically consistent part-aware representations. Anomalous objects are then detected through cross-instance comparison. We also introduce two new benchmarks, ToysAD-8K and PartsAD-15K as testbeds for future research in this task. We provide a comprehensive analysis of our method quantitatively and qualitatively on these benchmarks.
