An Investigation into the Performance of Non-Contrastive Self-Supervised Learning Methods for Network Intrusion Detection

Hamed Fard; Tobias Schalau; Gerhard Wunder

An Investigation into the Performance of Non-Contrastive Self-Supervised Learning Methods for Network Intrusion Detection

Hamed Fard, Tobias Schalau, Gerhard Wunder

TL;DR

This work tackles the challenge of limited labeled data in network intrusion detection by evaluating non-contrastive self-supervised learning (SSL) across multiple backbones and augmentation strategies. A two-stage, label-free pipeline learns normal-traffic representations with SSL (three encoders and six augmentations across five models) and detects anomalies via a K-means detector, evaluated on UNSW-NB15 and 5G-NIDD with 90 configurations. VICReg and Barlow Twins frequently yield top metrics, with Mixup (representation-space) and Gaussian Noise augmentation proving particularly effective on different datasets; however, autoencoder-based baselines can surpass non-contrastive SSL when properly tuned. The study highlights the critical roles of augmentation design and encoder choice in NIDS SSL, suggests that domain-specific augmentations and more advanced unsupervised detectors could further close the gap to reconstruction-based methods, and provides actionable insights for deploying label-efficient intrusion detection systems.

Abstract

Network intrusion detection, a well-explored cybersecurity field, has predominantly relied on supervised learning algorithms in the past two decades. However, their limitations in detecting only known anomalies prompt the exploration of alternative approaches. Motivated by the success of self-supervised learning in computer vision, there is a rising interest in adapting this paradigm for network intrusion detection. While prior research mainly delved into contrastive self-supervised methods, the efficacy of non-contrastive methods, in conjunction with encoder architectures serving as the representation learning backbone and augmentation strategies that determine what is learned, remains unclear for effective attack detection. This paper compares the performance of five non-contrastive self-supervised learning methods using three encoder architectures and six augmentation strategies. Ninety experiments are systematically conducted on two network intrusion detection datasets, UNSW-NB15 and 5G-NIDD. For each self-supervised model, the combination of encoder architecture and augmentation method yielding the highest average precision, recall, F1-score, and AUCROC is reported. Furthermore, by comparing the best-performing models to two unsupervised baselines, DeepSVDD, and an Autoencoder, we showcase the competitiveness of the non-contrastive methods for attack detection. Code at: https://github.com/renje4z335jh4/non_contrastive_SSL_NIDS

An Investigation into the Performance of Non-Contrastive Self-Supervised Learning Methods for Network Intrusion Detection

TL;DR

Abstract

An Investigation into the Performance of Non-Contrastive Self-Supervised Learning Methods for Network Intrusion Detection

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (3)