Diffusion-based Adversarial Purification for Intrusion Detection
Mohamed Amine Merzouk, Erwan Beurier, Reda Yaich, Nora Boulahia-Cuppens, Frédéric Cuppens
TL;DR
This work addresses adversarial vulnerability in ML-based intrusion detection by applying diffusion-based adversarial purification upstream of the IDS. The authors analyze how diffusion parameters—especially the optimal diffusion step $t^*$, the variance schedule $\beta$, and the number of diffusion steps $T$—interact with adversarial perturbations (magnitude $\epsilon$) across five attack methods on two datasets, UNSW-NB15 and NSL-KDD. They demonstrate that diffusion purification can restore high adversarial accuracy (up to ~80%) while preserving most of the clean data performance, with findings that the optimal noise level aligns with the perturbation magnitude and that larger $T$ improves robustness at the cost of latency. The paper provides practical guidance on parameter choices and highlights the need for combining diffusion-based purification with other defenses for robust, real-world IDS deployment. $t^*$, $\beta$, and $\sigma^2$ emerge as key quantities governing purification effectiveness, offering a pathway to principled defense design against evolving adversarial threats in network security.
Abstract
The escalating sophistication of cyberattacks has encouraged the integration of machine learning techniques in intrusion detection systems, but the rise of adversarial examples presents a significant challenge. These crafted perturbations mislead ML models, enabling attackers to evade detection or trigger false alerts. As a reaction, adversarial purification has emerged as a compelling solution, particularly with diffusion models showing promising results. However, their purification potential remains unexplored in the context of intrusion detection. This paper demonstrates the effectiveness of diffusion models in purifying adversarial examples in network intrusion detection. Through a comprehensive analysis of the diffusion parameters, we identify optimal configurations maximizing adversarial robustness with minimal impact on normal performance. Importantly, this study reveals insights into the relationship between diffusion noise and diffusion steps, representing a novel contribution to the field. Our experiments are carried out on two datasets and against 5 adversarial attacks. The implementation code is publicly available.
