Table of Contents
Fetching ...

Device Scheduling and Assignment in Hierarchical Federated Learning for Internet of Things

Tinghao Zhang, Kwok-Yan Lam, Jun Zhao

TL;DR

An improved $K$ -center algorithm for device scheduling and a deep reinforcement learning-based approach for assigning IoT devices to edge servers are proposed and introduced.

Abstract

Federated Learning (FL) is a promising machine learning approach for Internet of Things (IoT), but it has to address network congestion problems when the population of IoT devices grows. Hierarchical FL (HFL) alleviates this issue by distributing model aggregation to multiple edge servers. Nevertheless, the challenge of communication overhead remains, especially in scenarios where all IoT devices simultaneously join the training process. For scalability, practical HFL schemes select a subset of IoT devices to participate in the training, hence the notion of device scheduling. In this setting, only selected IoT devices are scheduled to participate in the global training, with each of them being assigned to one edge server. Existing HFL assignment methods are primarily based on search mechanisms, which suffer from high latency in finding the optimal assignment. This paper proposes an improved K-Center algorithm for device scheduling and introduces a deep reinforcement learning-based approach for assigning IoT devices to edge servers. Experiments show that scheduling 50% of IoT devices is generally adequate for achieving convergence in HFL with much lower time delay and energy consumption. In cases where reduction in energy consumption (such as in Green AI) and reduction of messages (to avoid burst traffic) are key objectives, scheduling 30% IoT devices allows a substantial reduction in energy and messages with similar model accuracy.

Device Scheduling and Assignment in Hierarchical Federated Learning for Internet of Things

TL;DR

An improved -center algorithm for device scheduling and a deep reinforcement learning-based approach for assigning IoT devices to edge servers are proposed and introduced.

Abstract

Federated Learning (FL) is a promising machine learning approach for Internet of Things (IoT), but it has to address network congestion problems when the population of IoT devices grows. Hierarchical FL (HFL) alleviates this issue by distributing model aggregation to multiple edge servers. Nevertheless, the challenge of communication overhead remains, especially in scenarios where all IoT devices simultaneously join the training process. For scalability, practical HFL schemes select a subset of IoT devices to participate in the training, hence the notion of device scheduling. In this setting, only selected IoT devices are scheduled to participate in the global training, with each of them being assigned to one edge server. Existing HFL assignment methods are primarily based on search mechanisms, which suffer from high latency in finding the optimal assignment. This paper proposes an improved K-Center algorithm for device scheduling and introduces a deep reinforcement learning-based approach for assigning IoT devices to edge servers. Experiments show that scheduling 50% of IoT devices is generally adequate for achieving convergence in HFL with much lower time delay and energy consumption. In cases where reduction in energy consumption (such as in Green AI) and reduction of messages (to avoid burst traffic) are key objectives, scheduling 30% IoT devices allows a substantial reduction in energy and messages with similar model accuracy.
Paper Structure (20 sections, 28 equations, 13 figures, 2 tables, 6 algorithms)

This paper contains 20 sections, 28 equations, 13 figures, 2 tables, 6 algorithms.

Figures (13)

  • Figure 1: Overview of the proposed HFL framework. At the beginning of the $i$-th global iteration, the improved K-centers scheduling (IKC) method is carried out to form the set $\mathcal{H}_i$. Next, the scheduled devices are assigned to the corresponding edge servers via the DRL-based device assignment method. Then, each edge server performs resource allocation to determine the bandwidth and CPU frequency of the scheduled devices. Finally, the training process begins according to Algorithm \ref{['HFL_training']}.
  • Figure 2: Workflow of the BiLSTMs-based model at the $t$-th time slot. $\phi$ represents the parameters of the LSTM modules.
  • Figure 3: Testing accuracy of HFL on FashionMNIST with different size of $\mathcal{H}$.
  • Figure 4: Testing accuracy of HFL on CIFAR-10 with different size of $\mathcal{H}$.
  • Figure 5: Learning curve of the proposed D$^3$QN.
  • ...and 8 more figures