Table of Contents
Fetching ...

Federated Learning Framework via Distributed Mutual Learning

Yash Gupta

TL;DR

The paper addresses privacy and bandwidth limitations in traditional federated learning by introducing a loss-based distributed mutual learning framework that exchanges loss predictions on a public test set instead of sharing model weights. It formalizes a loss objective $L = L_{model} + L_{KL}$ with $L_{KL} = \frac{1}{K-1}\sum_{j \neq i}^{K} P_i \log\frac{P_i}{P_j}$ to drive inter-client alignment across rounds. The proposed method is evaluated on a face-mask detection task using a VisionNet-like CNN and demonstrates higher unseen-data accuracy and better generalization than weight-sharing baselines, while preserving privacy and reducing bandwidth. The work suggests that distributed mutual learning can enable privacy-preserving, bandwidth-efficient federated learning in IoT contexts, with future directions including non-IID data handling and diverse architectures.

Abstract

Federated Learning often relies on sharing full or partial model weights, which can burden network bandwidth and raise privacy risks. We present a loss-based alternative using distributed mutual learning. Instead of transmitting weights, clients periodically share their loss predictions on a public test set. Each client then refines its model by combining its local loss with the average Kullback-Leibler divergence over losses from other clients. This collaborative approach both reduces transmission overhead and preserves data privacy. Experiments on a face mask detection task demonstrate that our method outperforms weight-sharing baselines, achieving higher accuracy on unseen data while providing stronger generalization and privacy benefits.

Federated Learning Framework via Distributed Mutual Learning

TL;DR

The paper addresses privacy and bandwidth limitations in traditional federated learning by introducing a loss-based distributed mutual learning framework that exchanges loss predictions on a public test set instead of sharing model weights. It formalizes a loss objective with to drive inter-client alignment across rounds. The proposed method is evaluated on a face-mask detection task using a VisionNet-like CNN and demonstrates higher unseen-data accuracy and better generalization than weight-sharing baselines, while preserving privacy and reducing bandwidth. The work suggests that distributed mutual learning can enable privacy-preserving, bandwidth-efficient federated learning in IoT contexts, with future directions including non-IID data handling and diverse architectures.

Abstract

Federated Learning often relies on sharing full or partial model weights, which can burden network bandwidth and raise privacy risks. We present a loss-based alternative using distributed mutual learning. Instead of transmitting weights, clients periodically share their loss predictions on a public test set. Each client then refines its model by combining its local loss with the average Kullback-Leibler divergence over losses from other clients. This collaborative approach both reduces transmission overhead and preserves data privacy. Experiments on a face mask detection task demonstrate that our method outperforms weight-sharing baselines, achieving higher accuracy on unseen data while providing stronger generalization and privacy benefits.

Paper Structure

This paper contains 14 sections, 2 equations, 4 figures, 2 tables, 1 algorithm.

Figures (4)

  • Figure 1: Proposed model architecture.
  • Figure 2: Proposed model architecture.
  • Figure 3: Client Performances on Testing Data
  • Figure 4: Training History