Improving Anomalous Sound Detection through Pseudo-anomalous Set Selection and Pseudo-label Utilization under Unlabeled Conditions
Ibuki Kuroyanagi, Takuya Fujimura, Kazuya Takeda, Tomoki Toda
TL;DR
The paper tackles anomalous sound detection when both suitable similar-machine data and detailed operation labels are scarce. It introduces three synergistic components: (1) pseudo-anomalous set selection from external data to augment normal-like samples, (2) triplet-loss–driven pseudo-labeling for unlabeled data, and (3) iterative learning to refine external-data selection and pseudo-labels across cycles. In unlabeled settings, the approach yields average AUC gains exceeding $6.6$ points and narrows the gap to fully supervised methods, while in labeled settings external data can provide additional boosts. The work demonstrates a practical ASD workflow that scales to diverse industrial contexts with limited annotation, providing robust deployment guidance and insights into data-volume trade-offs and loss-function choices.
Abstract
This paper addresses performance degradation in anomalous sound detection (ASD) when neither sufficiently similar machine data nor operational state labels are available. We present an integrated pipeline that combines three complementary components derived from prior work and extends them to the unlabeled ASD setting. First, we adapt an anomaly score based selector to curate external audio data resembling the normal sounds of the target machine. Second, we utilize triplet learning to assign pseudo-labels to unlabeled data, enabling finer classification of operational sounds and detection of subtle anomalies. Third, we employ iterative training to refine both the pseudo-anomalous set selection and pseudo-label assignment, progressively improving detection accuracy. Experiments on the DCASE2022-2024 Task 2 datasets demonstrate that, in unlabeled settings, our approach achieves an average AUC increase of over 6.6 points compared to conventional methods. In labeled settings, incorporating external data from the pseudo-anomalous set further boosts performance. These results highlight the practicality and robustness of our methods in scenarios with scarce machine data and labels, facilitating ASD deployment across diverse industrial settings with minimal annotation effort.
