Serial-OE: Anomalous sound detection based on serial method with outlier exposure capable of using small amounts of anomalous data for training
Ibuki Kuroyanagi, Tomoki Hayashi, Kazuya Takeda, Tomoki Toda
TL;DR
Serial-OE tackles ASD by enabling training with small amounts of anomalous data through an outlier-exposure framework that couples a per-type DNN feature extractor with per-ID GMM detectors. By using normal and pseudo-anomalous data (and optionally real anomalous data), it achieves competitive or superior ASD performance on DCASE2020 Task2 while remaining robust to data contamination and capable of operating without machine IDs. The method leverages Mixup, pretraining on ImageNet, and a norm-based type loss to shape a discriminative yet generative feature space, with a novel aggregation strategy to capture both stationary and non-stationary anomalies. The results suggest practical benefits for real-world ASD systems, including improved performance with minimal anomalous-data and resilience under data imperfections, though further work is needed for domain shift and edge deployment.
Abstract
We introduce Serial-OE, a new approach to anomalous sound detection (ASD) that leverages small amounts of anomalous data to improve the performance. Conventional ASD methods rely primarily on the modeling of normal data, due to the cost of collecting anomalous data from various possible types of equipment breakdowns. Our method improves upon existing ASD systems by implementing an outlier exposure framework that utilizes normal and pseudo-anomalous data for training, with the capability to also use small amounts of real anomalous data. A comprehensive evaluation using the DCASE2020 Task2 dataset shows that our method outperforms state-of-the-art ASD models. We also investigate the impact on performance of using a small amount of anomalous data during training, of using data without machine ID information, and of using contaminated training data. Our experimental results reveal the potential of using a very limited amount of anomalous data during training to address the limitations of existing methods using only normal data for training due to the scarcity of anomalous data. This study contributes to the field by presenting a method that can be dynamically adapted to include anomalous data during the operational phase of an ASD system, paving the way for more accurate ASD.
