Table of Contents
Fetching ...

UnifyFL: Enabling Decentralized Cross-Silo Federated Learning

Sarang S, Druva Dhakshinamoorthy, Aditya Shiva Sharma, Yuvraj Singh Bhadauria, Siddharth Chaitra Vivek, Arihant Bansal, Arnab K. Paul

TL;DR

UnifyFL tackles the challenge of enabling trustworthy and resource-efficient collaboration across diverse FL clusters by introducing a decentralized cross-silo FL framework. It combines a decentralized orchestrator on a private Ethereum network with IPFS-based distributed storage and a Flower-based training workflow, plus randomized scorers to audibly validate models, supporting both synchronous and asynchronous operation. The paper details the architecture, implementation, and a thorough evaluation on real testbeds, showing convergence and performance close to ideal multilevel FL with significantly reduced overhead. The work demonstrates practical potential for scalable, trustful inter-organizational FL and outlines future work on multi-model FL, Byzantine defense, and privacy-preserving enhancements.

Abstract

Federated Learning (FL) is a decentralized machine learning (ML) paradigm in which models are trained on private data across several devices called clients and combined at a single node called an aggregator rather than aggregating the data itself. Many organizations employ FL to have better privacy-aware ML-driven decision-making capabilities. However, organizations often operate independently rather than collaborate to enhance their FL capabilities due to the lack of an effective mechanism for collaboration. The challenge lies in balancing trust and resource efficiency. One approach relies on trusting a third-party aggregator to consolidate models from all organizations (multilevel FL), but this requires trusting an entity that may be biased or unreliable. Alternatively, organizations can bypass a third party by sharing their local models directly, which requires significant computational resources for validation. Both approaches reflect a fundamental trade-off between trust and resource constraints, with neither offering an ideal solution. In this work, we develop a trust-based cross-silo FL framework called UnifyFL, which uses decentralized orchestration and distributed storage. UnifyFL provides flexibility to the participating organizations and presents synchronous and asynchronous modes to handle stragglers. Our evaluation on a diverse testbed shows that UnifyFL achieves a performance comparable to the ideal multilevel centralized FL while allowing trust and optimal use of resources.

UnifyFL: Enabling Decentralized Cross-Silo Federated Learning

TL;DR

UnifyFL tackles the challenge of enabling trustworthy and resource-efficient collaboration across diverse FL clusters by introducing a decentralized cross-silo FL framework. It combines a decentralized orchestrator on a private Ethereum network with IPFS-based distributed storage and a Flower-based training workflow, plus randomized scorers to audibly validate models, supporting both synchronous and asynchronous operation. The paper details the architecture, implementation, and a thorough evaluation on real testbeds, showing convergence and performance close to ideal multilevel FL with significantly reduced overhead. The work demonstrates practical potential for scalable, trustful inter-organizational FL and outlines future work on multi-model FL, Byzantine defense, and privacy-preserving enhancements.

Abstract

Federated Learning (FL) is a decentralized machine learning (ML) paradigm in which models are trained on private data across several devices called clients and combined at a single node called an aggregator rather than aggregating the data itself. Many organizations employ FL to have better privacy-aware ML-driven decision-making capabilities. However, organizations often operate independently rather than collaborate to enhance their FL capabilities due to the lack of an effective mechanism for collaboration. The challenge lies in balancing trust and resource efficiency. One approach relies on trusting a third-party aggregator to consolidate models from all organizations (multilevel FL), but this requires trusting an entity that may be biased or unreliable. Alternatively, organizations can bypass a third party by sharing their local models directly, which requires significant computational resources for validation. Both approaches reflect a fundamental trade-off between trust and resource constraints, with neither offering an ideal solution. In this work, we develop a trust-based cross-silo FL framework called UnifyFL, which uses decentralized orchestration and distributed storage. UnifyFL provides flexibility to the participating organizations and presents synchronous and asynchronous modes to handle stragglers. Our evaluation on a diverse testbed shows that UnifyFL achieves a performance comparable to the ideal multilevel centralized FL while allowing trust and optimal use of resources.

Paper Structure

This paper contains 42 sections, 7 figures, 7 tables, 1 algorithm.

Figures (7)

  • Figure 1: An illustration of a collaborative FL architecture in the automobile industry, where vehicle fleets (FL clients) from different companies train models locally and share updates with their respective FL aggregators.
  • Figure 2: Approaches for collaborative FL.
  • Figure 3: Centralized (or Multilevel) aggregation vs Peer-to-Peer aggregation.
  • Figure 4: A step-by-step walk-through of UnifyFL Framework. (a) Clusters perform local training, aggregate the results, and submit local weight for scoring. (b) Scorers pull weights and submit their scores. (c) Aggregators pull and aggregate global models to send to clients for the next round.
  • Figure 5: Workflow of the Sync mode in UnifyFL with two aggregators.
  • ...and 2 more figures