A Crowdsensing Intrusion Detection Dataset For Decentralized Federated Learning Models

Chao Feng; Alberto Huertas Celdran; Jing Han; Heqing Ren; Xi Cheng; Zien Zeng; Lucas Krauter; Gerome Bovet; Burkhard Stiller

A Crowdsensing Intrusion Detection Dataset For Decentralized Federated Learning Models

Chao Feng, Alberto Huertas Celdran, Jing Han, Heqing Ren, Xi Cheng, Zien Zeng, Lucas Krauter, Gerome Bovet, Burkhard Stiller

Abstract

This paper introduces a dataset and an experimental study on Decentralized Federated Learning (DFL) for Internet of Things (IoT) crowdsensing malware detection. The dataset comprises behavioral records from benign and eight malware attacks. A total of 21,582,484 original records were collected from system calls, file system activities, resource usage, kernel events, input/output events, and network records. These records were aggregated into 30-second windows, resulting in 342,106 data records used for model training and evaluation. Experiments on the DFL platform compare traditional Machine Learning (ML), Centralized Federated Learning (CFL), and DFL across different node counts, topologies, and data distributions. Results show that DFL maintains competitive performance while preserving data locality, outperforming CFL in most settings. This dataset provides a solid foundation for studying the security of IoT crowdsensing environments.

A Crowdsensing Intrusion Detection Dataset For Decentralized Federated Learning Models

Abstract

A Crowdsensing Intrusion Detection Dataset For Decentralized Federated Learning Models

Abstract

Paper Structure

Table of Contents

Figures (5)