Adversarial Robustness of Distilled and Pruned Deep Learning-based Wireless Classifiers
Nayan Moni Baishya, B. R. Manoj
TL;DR
This paper tackles the vulnerability of DL-based automatic modulation classification (AMC) to adversarial perturbations in edge settings. It introduces two optimized models—distilled and distill-pruned—via knowledge distillation from a large teacher and Net-Trim pruning to achieve high sparsity with maintained accuracy. Adversarial robustness is pursued with a computationally efficient PGD-FGSM hybrid adversarial training strategy, evaluated against five white-box attacks on the RML2016.10A dataset, demonstrating improved robustness over the standard model while preserving clean-sample performance. The results indicate that combining distillation with pruning yields both sparsity and resilience, enabling practical, secure edge deployments of DL-based AMC; future work will explore retraining-free countermeasures for on-device protection.
Abstract
Data-driven deep learning (DL) techniques developed for automatic modulation classification (AMC) of wireless signals are vulnerable to adversarial attacks. This poses a severe security threat to the DL-based wireless systems, specifically for edge applications of AMC. In this work, we address the joint problem of developing optimized DL models that are also robust against adversarial attacks. This enables efficient and reliable deployment of DL-based AMC on edge devices. We first propose two optimized models using knowledge distillation and network pruning, followed by a computationally efficient adversarial training process to improve the robustness. Experimental results on five white-box attacks show that the proposed optimized and adversarially trained models can achieve better robustness than the standard (unoptimized) model. The two optimized models also achieve higher accuracy on clean (unattacked) samples, which is essential for the reliability of DL-based solutions at edge applications.
