Defect Spectrum: A Granular Look of Large-Scale Defect Datasets with Rich Semantics
Shuai Yang, Zhifei Chen, Pengguang Chen, Xi Fang, Yixun Liang, Shu Liu, Yingcong Chen
TL;DR
The paper introduces Defect Spectrum, a large, semantically rich industrial defect dataset built on four benchmarks to provide precise, multi-class defect annotations and descriptive captions, addressing the lack of granularity in existing datasets. To overcome data scarcity, it proposes Defect-Gen, a two-stage diffusion-based generator that models global structure with a large receptive field and local detail with a small receptive field, using patch-level processing and a custom auxiliary tool, Defect-Click, to accelerate precise annotation. The authors demonstrate improved defect segmentation performance and substantial gains in downstream tasks when incorporating synthetic data, while also delivering a comprehensive set of visual and quantitative analyses across multiple datasets. This work advances practical defect inspection by enabling finer-grained analysis, richer semantics, and more robust model training in low-data industrial settings.
Abstract
Defect inspection is paramount within the closed-loop manufacturing system. However, existing datasets for defect inspection often lack precision and semantic granularity required for practical applications. In this paper, we introduce the Defect Spectrum, a comprehensive benchmark that offers precise, semantic-abundant, and large-scale annotations for a wide range of industrial defects. Building on four key industrial benchmarks, our dataset refines existing annotations and introduces rich semantic details, distinguishing multiple defect types within a single image. Furthermore, we introduce Defect-Gen, a two-stage diffusion-based generator designed to create high-quality and diverse defective images, even when working with limited datasets. The synthetic images generated by Defect-Gen significantly enhance the efficacy of defect inspection models. Overall, The Defect Spectrum dataset demonstrates its potential in defect inspection research, offering a solid platform for testing and refining advanced models.
