Redefining Normal: A Novel Object-Level Approach for Multi-Object Novelty Detection
Mohammadreza Salehi, Nikolaos Apostolikas, Efstratios Gavves, Cees G. M. Snoek, Yuki M. Asano
TL;DR
This work addresses semantic novelty detection in multi-object scenes by redefining the training normal as the predominant object. It introduces DEFEND, a dense feature fine-tuning stage that enforces object-level consistency across patches, and a guided masked knowledge distillation framework that trains a student from partial inputs using the teacher's attention guidance. Together, these components achieve state-of-the-art performance on multi-object benchmarks like Pascal VOC and COCO, while maintaining competitive results on single-object datasets and offering substantial inference efficiency gains. The approach emphasizes object-centric representations to bridge the gap between real-world, cluttered images and prior object-centric methods, providing a scalable solution for practical anomaly detection.
Abstract
In the realm of novelty detection, accurately identifying outliers in data without specific class information poses a significant challenge. While current methods excel in single-object scenarios, they struggle with multi-object situations due to their focus on individual objects. Our paper suggests a novel approach: redefining `normal' at the object level in training datasets. Rather than the usual image-level view, we consider the most dominant object in a dataset as the norm, offering a perspective that is more effective for real-world scenarios. Adapting to our object-level definition of `normal', we modify knowledge distillation frameworks, where a student network learns from a pre-trained teacher network. Our first contribution, DeFeND(Dense Feature Fine-tuning on Normal Data), integrates dense feature fine-tuning into the distillation process, allowing the teacher network to focus on object-level features with a self-supervised loss. The second is masked knowledge distillation, where the student network works with partially hidden inputs, honing its ability to deduce and generalize from incomplete data. This approach not only fares well in single-object novelty detection but also considerably surpasses existing methods in multi-object contexts. The implementation is available at: https://github.com/SMSD75/Redefining_Normal_ACCV24/tree/main
