CSAGC-IDS: A Dual-Module Deep Learning Network Intrusion Detection Model for Complex and Imbalanced Data
Yifan Zeng
TL;DR
CSAGC-IDS tackles the challenge of network intrusion detection under high-dimensional and imbalanced traffic by pairing a data-balancing generator (SC-CGAN) with a cost-sensitive classifier (CSCA-CNN). SC-CGAN uses a self-attention-augmented conditional GAN to synthesize minority-class samples, while CSCA-CNN applies channel attention and a cost-sensitive loss to improve minority-class detection. On NSL-KDD, CSAGC-IDS achieves 91.09% accuracy and 92.04% F1 for binary classification and 84.55% accuracy and 84.52% F1 for five-class classification, with interpretability analyses via LIME and SHAP. This dual-module architecture advances intrusion detection by addressing data imbalance, high dimensionality, and the need for model transparency, offering a practical approach for robust security analytics.
Abstract
As computer networks proliferate, the gravity of network intrusions has escalated, emphasizing the criticality of network intrusion detection systems for safeguarding security. While deep learning models have exhibited promising results in intrusion detection, they face challenges in managing high-dimensional, complex traffic patterns and imbalanced data categories. This paper presents CSAGC-IDS, a network intrusion detection model based on deep learning techniques. CSAGC-IDS integrates SC-CGAN, a self-attention-enhanced convolutional conditional generative adversarial network that generates high-quality data to mitigate class imbalance. Furthermore, CSAGC-IDS integrates CSCA-CNN, a convolutional neural network enhanced through cost sensitive learning and channel attention mechanism, to extract features from complex traffic data for precise detection. Experiments conducted on the NSL-KDD dataset. CSAGC-IDS achieves an accuracy of 84.55% and an F1-score of 84.52% in five-class classification task, and an accuracy of 91.09% and an F1 score of 92.04% in binary classification task.Furthermore, this paper provides an interpretability analysis of the proposed model, using SHAP and LIME to explain the decision-making mechanisms of the model.
