Table of Contents
Fetching ...

Towards Adaptive Asynchronous Federated Learning for Human Activity Recognition

Rastko Gajanin, Anastasiya Danilenka, Andrea Morichetta, Stefan Nastic

TL;DR

This work focuses on the combination of FL applied to Human Activity Recognition (HAR), where the task is to detect which kind of movements or actions individuals perform, and provides an open-source extension of the Flower framework that enables asynchronous FL.

Abstract

In this work, we tackle the problem of performing multi-label classification in the case of extremely heterogeneous data and with decentralized Machine Learning. Solving this issue is very important in IoT scenarios, where data coming from various sources, collected by heterogeneous devices, serve the learning of a distributed ML model through Federated Learning (FL). Specifically, we focus on the combination of FL applied to Human Activity Recognition HAR), where the task is to detect which kind of movements or actions individuals perform. In this case, transitioning from centralized learning (CL) to federated learning is non-trivial as HAR displays heterogeneity in action and devices, leading to significant skews in label and feature distributions. We address this scenario by presenting concrete solutions and tools for transitioning from centralized to FL for non-IID scenarios, outlining the main design decisions that need to be taken. Leveraging an open-sourced HAR dataset, we experimentally evaluate the effects that data augmentation, scaling, optimizer, learning rate, and batch size choices have on the performance of resulting machine learning models. Some of our main findings include using SGD-m as an optimizer, global feature scaling across clients, and persistent feature skew in the presence of heterogeneous HAR data. Finally, we provide an open-source extension of the Flower framework that enables asynchronous FL.

Towards Adaptive Asynchronous Federated Learning for Human Activity Recognition

TL;DR

This work focuses on the combination of FL applied to Human Activity Recognition (HAR), where the task is to detect which kind of movements or actions individuals perform, and provides an open-source extension of the Flower framework that enables asynchronous FL.

Abstract

In this work, we tackle the problem of performing multi-label classification in the case of extremely heterogeneous data and with decentralized Machine Learning. Solving this issue is very important in IoT scenarios, where data coming from various sources, collected by heterogeneous devices, serve the learning of a distributed ML model through Federated Learning (FL). Specifically, we focus on the combination of FL applied to Human Activity Recognition HAR), where the task is to detect which kind of movements or actions individuals perform. In this case, transitioning from centralized learning (CL) to federated learning is non-trivial as HAR displays heterogeneity in action and devices, leading to significant skews in label and feature distributions. We address this scenario by presenting concrete solutions and tools for transitioning from centralized to FL for non-IID scenarios, outlining the main design decisions that need to be taken. Leveraging an open-sourced HAR dataset, we experimentally evaluate the effects that data augmentation, scaling, optimizer, learning rate, and batch size choices have on the performance of resulting machine learning models. Some of our main findings include using SGD-m as an optimizer, global feature scaling across clients, and persistent feature skew in the presence of heterogeneous HAR data. Finally, we provide an open-source extension of the Flower framework that enables asynchronous FL.

Paper Structure

This paper contains 25 sections, 2 equations, 7 figures, 2 tables.

Figures (7)

  • Figure 1: Violin plot highlighting the data and label skew. We can see how certain activities are underrepresented, as well as, from the violin width, how each activity is not equally distributed across clients.
  • Figure 2: Label distribution with different data augmentation settings: none, base and balanced. Note that in the balanced setting running and cycling are still less represented globally.
  • Figure 3: Zoomed-in convergence plots of the evaluation metrics for different data augmentation schemes (DA) in asynchronous FL. Different line length is the result of fixing a number of average client updates rather than train time.
  • Figure 4: Zoomed-in convergence plots of the evaluation metrics for different feature standardization schemes (STD) in asynchronous FL.
  • Figure 5: Boxplots depicting features distributions of the running class samples for different clients (varied by color) [Sensor: accelerometer-magnitude]
  • ...and 2 more figures