Attention Augmented GNN RNN-Attention Models for Advanced Cybersecurity Intrusion Detection
Jayant Biradar, Smit Shah, Tanmay Naik
TL;DR
This paper introduces a novel hybrid intrusion detection framework that merges Graph Neural Networks for spatial network relationships, bidirectional LSTMs for temporal dynamics, and multi head attention for feature prioritization and interpretability. Tested on the UNSW-NB15 dataset, the GNN-RNN-Attention model achieves state of the art performance with an overall accuracy of 97.5% and an AUC-ROC of 0.991, while maintaining high recall to minimize undetected attacks. Ablation studies confirm the synergistic benefits of each component and attention driven interpretability. The approach offers practical deployment advantages with scalable computation and attention based insights to support security analysts in real time threat detection and response.
Abstract
In this paper, we propose a novel hybrid deep learning architecture that synergistically combines Graph Neural Networks (GNNs), Recurrent Neural Networks (RNNs), and multi-head attention mechanisms to significantly enhance cybersecurity intrusion detection capabilities. By leveraging the comprehensive UNSW-NB15 dataset containing diverse network traffic patterns, our approach effectively captures both spatial dependencies through graph structural relationships and temporal dynamics through sequential analysis of network events. The integrated attention mechanism provides dual benefits of improved model interpretability and enhanced feature selection, enabling cybersecurity analysts to focus computational resources on high-impact security events -- a critical requirement in modern real-time intrusion detection systems. Our extensive experimental evaluation demonstrates that the proposed hybrid model achieves superior performance compared to traditional machine learning approaches and standalone deep learning models across multiple evaluation metrics, including accuracy, precision, recall, and F1-score. The model achieves particularly strong performance in detecting sophisticated attack patterns such as Advanced Persistent Threats (APTs), Distributed Denial of Service (DDoS) attacks, and zero-day exploits, making it a promising solution for next-generation cybersecurity applications in complex network environments.
