CableInspect-AD: An Expert-Annotated Anomaly Detection Dataset
Akshatha Arodi, Margaux Luck, Jean-Luc Bedwani, Aldo Zaimi, Ge Li, Nicolas Pouliot, Julien Beaudry, Gaétan Marceau Caron
TL;DR
CableInspect-AD tackles the gap in transferring visual anomaly detection to high-stakes industrial contexts by introducing a richly annotated, real-world dataset for robotic power-line cable inspection. The authors propose Enhanced-PatchCore, a threshold-estimation method that uses only training nominal images, and systematically evaluate it alongside open conversational Vision-Language Models in zero-shot settings through a robust cross-validation protocol. Results show strong zero-shot capabilities for VLMs but uneven performance across anomaly types and grades, while Enhanced-PatchCore remains competitive, especially for pixel-level segmentation and few-/many-shot settings. The dataset and evaluation framework establish a challenging, open benchmark to drive robust VAD in real-world industrial applications and encourage future work on zero-shot detection, localization, and segmentation in specialized domains.
Abstract
Machine learning models are increasingly being deployed in real-world contexts. However, systematic studies on their transferability to specific and critical applications are underrepresented in the research literature. An important example is visual anomaly detection (VAD) for robotic power line inspection. While existing VAD methods perform well in controlled environments, real-world scenarios present diverse and unexpected anomalies that current datasets fail to capture. To address this gap, we introduce $\textit{CableInspect-AD}$, a high-quality, publicly available dataset created and annotated by domain experts from Hydro-Québec, a Canadian public utility. This dataset includes high-resolution images with challenging real-world anomalies, covering defects with varying severity levels. To address the challenges of collecting diverse anomalous and nominal examples for setting a detection threshold, we propose an enhancement to the celebrated PatchCore algorithm. This enhancement enables its use in scenarios with limited labeled data. We also present a comprehensive evaluation protocol based on cross-validation to assess models' performances. We evaluate our $\textit{Enhanced-PatchCore}$ for few-shot and many-shot detection, and Vision-Language Models for zero-shot detection. While promising, these models struggle to detect all anomalies, highlighting the dataset's value as a challenging benchmark for the broader research community. Project page: https://mila-iqia.github.io/cableinspect-ad/.
