Table of Contents
Fetching ...

Towards Unsupervised Validation of Anomaly-Detection Models

Lihi Idan

TL;DR

This work tackles unsupervised validation of anomaly-detection models, focusing on model selection and evaluation without labeled data. It introduces Accurately-Diverse ensembles that balance intra-ensemble agreement on general trends with intra-ensemble disagreement on exact rankings, operationalized through novel multi-way rank correlations and rank-cluster features. The Unsupervised Ensemble Divergence ($\mathcal{UED}$) score is proposed to compare a candidate model against the ensemble in a distance-based, weighted, unsupervised fashion. Empirical results on ten datasets show that Accurately-Diverse ensembles improve over the average unsupervised model and that $\mathcal{UED}$ correlates with supervised evaluation metrics, enabling effective unsupervised model selection and evaluation with practical impact for label-scarce regimes.

Abstract

Unsupervised validation of anomaly-detection models is a highly challenging task. While the common practices for model validation involve a labeled validation set, such validation sets cannot be constructed when the underlying datasets are unlabeled. The lack of robust and efficient unsupervised model-validation techniques presents an acute challenge in the implementation of automated anomaly-detection pipelines, especially when there exists no prior knowledge of the model's performance on similar datasets. This work presents a new paradigm to automated validation of anomaly-detection models, inspired by real-world, collaborative decision-making mechanisms. We focus on two commonly-used, unsupervised model-validation tasks -- model selection and model evaluation -- and provide extensive experimental results that demonstrate the accuracy and robustness of our approach on both tasks.

Towards Unsupervised Validation of Anomaly-Detection Models

TL;DR

This work tackles unsupervised validation of anomaly-detection models, focusing on model selection and evaluation without labeled data. It introduces Accurately-Diverse ensembles that balance intra-ensemble agreement on general trends with intra-ensemble disagreement on exact rankings, operationalized through novel multi-way rank correlations and rank-cluster features. The Unsupervised Ensemble Divergence () score is proposed to compare a candidate model against the ensemble in a distance-based, weighted, unsupervised fashion. Empirical results on ten datasets show that Accurately-Diverse ensembles improve over the average unsupervised model and that correlates with supervised evaluation metrics, enabling effective unsupervised model selection and evaluation with practical impact for label-scarce regimes.

Abstract

Unsupervised validation of anomaly-detection models is a highly challenging task. While the common practices for model validation involve a labeled validation set, such validation sets cannot be constructed when the underlying datasets are unlabeled. The lack of robust and efficient unsupervised model-validation techniques presents an acute challenge in the implementation of automated anomaly-detection pipelines, especially when there exists no prior knowledge of the model's performance on similar datasets. This work presents a new paradigm to automated validation of anomaly-detection models, inspired by real-world, collaborative decision-making mechanisms. We focus on two commonly-used, unsupervised model-validation tasks -- model selection and model evaluation -- and provide extensive experimental results that demonstrate the accuracy and robustness of our approach on both tasks.

Paper Structure

This paper contains 12 sections, 13 equations, 1 figure, 2 tables, 2 algorithms.

Figures (1)

  • Figure 1: Accurately Diverse Ensembles: System Diagram

Theorems & Definitions (2)

  • Definition 1
  • Definition 2