Table of Contents
Fetching ...

Optimizing Malware Detection in IoT Networks: Leveraging Resource-Aware Distributed Computing for Enhanced Security

Sreenitha Kasarapu, Sanket Shukla, Sai Manoj Pudukotai Dinakarrao

TL;DR

The paper tackles malware detection in resource-constrained IoT networks by introducing a resource- and workload-aware distributed framework that partitions a CNN-based classifier across neighboring devices and uses a lightweight resource estimator to decide on-device versus offload. It integrates HPC-based microarchitectural traces converted to grayscale images, PCA for feature reduction, AllReduce-inspired distributed inference, and Downpour SGD for fault tolerance, achieving a $9.8×$ speed-up and an accuracy of $0.967$ on a mid-scale IoT testbed. The approach preserves data privacy by distributing task components rather than transferring complete classifiers, and it demonstrates practical viability for real-time defense in IoT environments. These results indicate substantial benefits for scalable, privacy-preserving malware detection in resource-limited IoT deployments and lay groundwork for broader adoption in edge computing scenarios.

Abstract

In recent years, networked IoT systems have revolutionized connectivity, portability, and functionality, offering a myriad of advantages. However, these systems are increasingly targeted by adversaries due to inherent security vulnerabilities and limited computational and storage resources. Malicious applications, commonly known as malware, pose a significant threat to IoT devices and networks. While numerous malware detection techniques have been proposed, existing approaches often overlook the resource constraints inherent in IoT environments, assuming abundant resources for detection tasks. This oversight is compounded by ongoing workloads such as sensing and on-device computations, further diminishing available resources for malware detection. To address these challenges, we present a novel resource- and workload-aware malware detection framework integrated with distributed computing for IoT networks. Our approach begins by analyzing available resources for malware detection using a lightweight regression model. Depending on resource availability, ongoing workload executions, and communication costs, the malware detection task is dynamically allocated either on-device or offloaded to neighboring IoT nodes with sufficient resources. To safeguard data integrity and user privacy, rather than transferring the entire malware detection task, the classifier is partitioned and distributed across multiple nodes, and subsequently integrated at the parent node for comprehensive malware detection. Experimental analysis demonstrates the efficacy of our proposed technique, achieving a remarkable speed-up of 9.8x compared to on-device inference, while maintaining a high malware detection accuracy of 96.7%.

Optimizing Malware Detection in IoT Networks: Leveraging Resource-Aware Distributed Computing for Enhanced Security

TL;DR

The paper tackles malware detection in resource-constrained IoT networks by introducing a resource- and workload-aware distributed framework that partitions a CNN-based classifier across neighboring devices and uses a lightweight resource estimator to decide on-device versus offload. It integrates HPC-based microarchitectural traces converted to grayscale images, PCA for feature reduction, AllReduce-inspired distributed inference, and Downpour SGD for fault tolerance, achieving a speed-up and an accuracy of on a mid-scale IoT testbed. The approach preserves data privacy by distributing task components rather than transferring complete classifiers, and it demonstrates practical viability for real-time defense in IoT environments. These results indicate substantial benefits for scalable, privacy-preserving malware detection in resource-limited IoT deployments and lay groundwork for broader adoption in edge computing scenarios.

Abstract

In recent years, networked IoT systems have revolutionized connectivity, portability, and functionality, offering a myriad of advantages. However, these systems are increasingly targeted by adversaries due to inherent security vulnerabilities and limited computational and storage resources. Malicious applications, commonly known as malware, pose a significant threat to IoT devices and networks. While numerous malware detection techniques have been proposed, existing approaches often overlook the resource constraints inherent in IoT environments, assuming abundant resources for detection tasks. This oversight is compounded by ongoing workloads such as sensing and on-device computations, further diminishing available resources for malware detection. To address these challenges, we present a novel resource- and workload-aware malware detection framework integrated with distributed computing for IoT networks. Our approach begins by analyzing available resources for malware detection using a lightweight regression model. Depending on resource availability, ongoing workload executions, and communication costs, the malware detection task is dynamically allocated either on-device or offloaded to neighboring IoT nodes with sufficient resources. To safeguard data integrity and user privacy, rather than transferring the entire malware detection task, the classifier is partitioned and distributed across multiple nodes, and subsequently integrated at the parent node for comprehensive malware detection. Experimental analysis demonstrates the efficacy of our proposed technique, achieving a remarkable speed-up of 9.8x compared to on-device inference, while maintaining a high malware detection accuracy of 96.7%.
Paper Structure (14 sections, 2 equations, 3 figures, 2 tables, 2 algorithms)

This paper contains 14 sections, 2 equations, 3 figures, 2 tables, 2 algorithms.

Figures (3)

  • Figure 1: (a) Distributed IoT Devices Framework, (b) HPC and Binary Data Pre-processing to Extract Input Image Dataset, (c) Framework to Identify the Resources in the Model
  • Figure 2: Latency of Distributed learning for Malware Detection
  • Figure 3: Resource Consumption for Inference Over n Nodes