Exploiting Features and Logits in Heterogeneous Federated Learning

Yun-Hin Chan; Edith C. -H. Ngai

Exploiting Features and Logits in Heterogeneous Federated Learning

Yun-Hin Chan, Edith C. -H. Ngai

TL;DR

This work tackles federated learning when clients have heterogeneous neural architectures and privacy constraints by eliminating the need for a public dataset. It introduces Felo, which exchanges per-class mid-level features and logits to guide local updates, and Velo, which augments this with a server-side CVAE to model latent relationships and generate synthetic features. Empirical results on CIFAR-10 and CINIC-10 show that Felo and especially Velo outperform state-of-the-art heterogeneous FL baselines and can even surpass FedAvg in homogeneous settings, demonstrating robustness to data non-IID and extreme model heterogeneity. The methods offer a practical approach for edge AI in IoT, enabling effective training across diverse devices while preserving privacy, with promising directions for parallelizing server and client computations.

Abstract

Due to the rapid growth of IoT and artificial intelligence, deploying neural networks on IoT devices is becoming increasingly crucial for edge intelligence. Federated learning (FL) facilitates the management of edge devices to collaboratively train a shared model while maintaining training data local and private. However, a general assumption in FL is that all edge devices are trained on the same machine learning model, which may be impractical considering diverse device capabilities. For instance, less capable devices may slow down the updating process because they struggle to handle large models appropriate for ordinary devices. In this paper, we propose a novel data-free FL method that supports heterogeneous client models by managing features and logits, called Felo; and its extension with a conditional VAE deployed in the server, called Velo. Felo averages the mid-level features and logits from the clients at the server based on their class labels to provide the average features and logits, which are utilized for further training the client models. Unlike Felo, the server has a conditional VAE in Velo, which is used for training mid-level features and generating synthetic features according to the labels. The clients optimize their models based on the synthetic features and the average logits. We conduct experiments on two datasets and show satisfactory performances of our methods compared with the state-of-the-art methods.

Exploiting Features and Logits in Heterogeneous Federated Learning

TL;DR

Abstract

Paper Structure (16 sections, 2 equations, 5 figures, 4 tables, 2 algorithms)

This paper contains 16 sections, 2 equations, 5 figures, 4 tables, 2 algorithms.

Introduction
Problem Formulation
Our algorithms
Felo
Initial training
Average mid-level features and logits
Training on logits and features
Velo
Training mid-level features with CVAE
Experiments
Training on iid datasets
Training on non-iid datasets
Compared to FedAvg
Experiments on extreme heterogeneous models
Experiments on cVAE training interval.
...and 1 more sections

Figures (5)

Figure 1: The problem illustration of system heterogeneity. These clients are the participants in the federated learning process. The client models of participants are different because of their various available resources. Therefore, the cloud server utilizes shared knowledge from extracted features and logits instead of model weights to update the client models.
Figure 2: The architecture of the client models. A client model is divided into two parts, namely the feature extractor and the classifier. The network architectures of these two parts are not restricted in our design.
Figure 3: The process of handling logits in Felo and Velo is shown in \ref{['fig_logits']}, and the process of handling mid-level features in Velo is shown in \ref{['fig_velo']}.
Figure 4: Model accuracy of iid and non-iid data in CIFAR-10 and CINIC-10.
Figure 5: Convergence analyses of iid and non-iid data in CIFAR-10 and CINIC-10.

Exploiting Features and Logits in Heterogeneous Federated Learning

TL;DR

Abstract

Exploiting Features and Logits in Heterogeneous Federated Learning

Authors

TL;DR

Abstract

Table of Contents

Figures (5)