Carbon-Aware Intrusion Detection: A Comparative Study of Supervised and Unsupervised DRL for Sustainable IoT Edge Gateways
Saeid Jamshidi, Foutse Khomh, Kawser Wazed Nafi, Amin Nikanjam, Samira Keivanpour, Omar Abdul-Wahab, Martine Bellaiche
TL;DR
This work tackles DDoS intrusion detection at IoT edge gateways with two DRL-based architectures designed for sustainability. DeepEdgeIDS offers unsupervised adaptability via an AE–DRL hybrid, while AutoDRL-IDS provides supervised, temporally aware detection through an LSTM–DRL fusion, both incorporating a carbon-aware reward to balance detection performance, latency, and environmental impact. The authors demonstrate theoretical guarantees (convergence, stability, Pareto optimality) and validate performance on a real-edge testbed with Bot-IoT data, showing DeepEdgeIDS achieving higher detection accuracy and faster adaptation, and AutoDRL-IDS delivering more energy- and carbon-efficient operation. The study highlights a practical trade-off between adaptability and efficiency, establishing a carbon-aware DRL framework as a viable path toward sustainable, real-time IoT security at the edge.
Abstract
The rapid expansion of the Internet of Things (IoT) has intensified cybersecurity challenges, particularly in mitigating Distributed Denial-of-Service (DDoS) attacks at the network edge. Traditional Intrusion Detection Systems (IDSs) face significant limitations, including poor adaptability to evolving and zero-day attacks, reliance on static signatures and labeled datasets, and inefficiency on resource-constrained edge gateways. Moreover, most existing DRL-based IDS studies overlook sustainability factors such as energy efficiency and carbon impact. To address these challenges, this paper proposes two novel Deep Reinforcement Learning (DRL)-based IDS: DeepEdgeIDS, an unsupervised Autoencoder-DRL hybrid, and AutoDRL-IDS, a supervised LSTM-DRL model. Both DRL-based IDS are validated through theoretical analysis and experimental evaluation on edge gateways. Results demonstrate that AutoDRL-IDS achieves 94% detection accuracy using labeled data, while DeepEdgeIDS attains 98% accuracy and adaptability without labels. Distinctly, this study introduces a carbon-aware, multi-objective reward function optimized for sustainable and real-time IDS operations in dynamic IoT networks.
