Local Density-Based Anomaly Score Normalization for Domain Generalization
Kevin Wilkinghoff, Haici Yang, Janek Ebbers, François G. Germain, Gordon Wichern, Jonathan Le Roux
TL;DR
This paper tackles domain generalization in anomalous sound detection by addressing the domain mismatch in anomaly-score distributions across source and target domains. It introduces a local-density-based anomaly-score normalization with two variants, $A^{K-NN}_{scaled}$ and $A^{GWRP}_{scaled}$, which reweight scores using local reference densities defined through $K$-nearest neighbors or a weighted density with parameter $r$. Across five DCASE ASD datasets and multiple embeddings, the normalization consistently improves target-domain performance and often outperforms existing normalization strategies without requiring domain labels or domain-specific training. The proposed method enables robust, single-threshold ASD under diverse domain shifts and demonstrates strong potential for real-world deployment, including ensemble gains that achieve competitive or state-of-the-art results on several benchmarks.
Abstract
State-of-the-art anomalous sound detection (ASD) systems in domain-shifted conditions rely on projecting audio signals into an embedding space and using distance-based outlier detection to compute anomaly scores. One of the major difficulties to overcome is the so-called domain mismatch between the anomaly score distributions of a source domain and a target domain that differ acoustically and in terms of the amount of training data provided. A decision threshold that is optimal for one domain may be highly sub-optimal for the other domain and vice versa. This significantly degrades the performance when only using a single decision threshold, as is required when generalizing to multiple data domains that are possibly unseen during training while still using the same trained ASD system as in the source domain. To reduce this mismatch between the domains, we propose a simple local-density-based anomaly score normalization scheme. In experiments conducted on several ASD datasets, we show that the proposed normalization scheme consistently improves performance for various types of embedding-based ASD systems and yields better results than existing anomaly score normalization approaches.
