Trustworthy Intrusion Detection: Confidence Estimation Using Latent Space

Ioannis Pitsiorlas; George Arvanitakis; Marios Kountouris

Trustworthy Intrusion Detection: Confidence Estimation Using Latent Space

Ioannis Pitsiorlas, George Arvanitakis, Marios Kountouris

TL;DR

This work addresses the reliability of anomaly detection in Intrusion Detection Systems by introducing a confidence metric derived from a Variational Autoencoder's latent space. The approach uses Mahalanobis distance in the latent space to quantify how representative an unknown sample is of the training data, thereby estimating the expected error $\hat{e}$ and producing a confidence score $\mathcal{C}$ that correlates with $\hat{e}$ (reported around $r=0.45$). The method is validated on the NSL-KDD dataset for a binary normal-vs-malicious task, with careful tuning of latent dimensionality and KL weight $\beta$, and is compared against Choquet–Mahalanobis and naïve distances. The results show that latent-space confidence can reduce false positives and improve trust in IDS outputs, with practical implications for real-time secure monitoring, while also outlining avenues for broader datasets and multi-class extensions.

Abstract

This work introduces a novel method for enhancing confidence in anomaly detection in Intrusion Detection Systems (IDS) through the use of a Variational Autoencoder (VAE) architecture. By developing a confidence metric derived from latent space representations, we aim to improve the reliability of IDS predictions against cyberattacks. Applied to the NSL-KDD dataset, our approach focuses on binary classification tasks to effectively distinguish between normal and malicious network activities. The methodology demonstrates a significant enhancement in anomaly detection, evidenced by a notable correlation of 0.45 between the reconstruction error and the proposed metric. Our findings highlight the potential of employing VAEs for more accurate and trustworthy anomaly detection in network security.

Trustworthy Intrusion Detection: Confidence Estimation Using Latent Space

TL;DR

and producing a confidence score

that correlates with

(reported around

). The method is validated on the NSL-KDD dataset for a binary normal-vs-malicious task, with careful tuning of latent dimensionality and KL weight

, and is compared against Choquet–Mahalanobis and naïve distances. The results show that latent-space confidence can reduce false positives and improve trust in IDS outputs, with practical implications for real-time secure monitoring, while also outlining avenues for broader datasets and multi-class extensions.

Abstract

Paper Structure (21 sections, 11 equations, 5 figures, 3 tables)

This paper contains 21 sections, 11 equations, 5 figures, 3 tables.

Introduction
System Model
Optimization Problem
Anomaly Detection
Proposed Methodology
Prediction Error Calculation and Post Hoc Classification
Mahalanobis distance
Confidence Metric Calculation
Evaluation Metric
Experimental Results
The NSL-KDD Dataset
VAE Architecture and Training Curriculum
Adjusting Model Parameters
Latent Space Dimension
KL weight $\beta$
...and 6 more sections

Figures (5)

Figure 1: The main components of our VAE architecture
Figure 2: Train Set
Figure 3: Test Set
Figure 5: Weights for the KL loss and correlation
Figure 6: Confusion Matrix

Trustworthy Intrusion Detection: Confidence Estimation Using Latent Space

TL;DR

Abstract

Trustworthy Intrusion Detection: Confidence Estimation Using Latent Space

Authors

TL;DR

Abstract

Table of Contents

Figures (5)