dFLMoE: Decentralized Federated Learning via Mixture of Experts for Medical Data Analysis

Luyuan Xie; Tianyu Luan; Wenyuan Cai; Guochen Yan; Zhaoyu Chen; Nan Xi; Yuejian Fang; Qingni Shen; Zhonghai Wu; Junsong Yuan

dFLMoE: Decentralized Federated Learning via Mixture of Experts for Medical Data Analysis

Luyuan Xie, Tianyu Luan, Wenyuan Cai, Guochen Yan, Zhaoyu Chen, Nan Xi, Yuejian Fang, Qingni Shen, Zhonghai Wu, Junsong Yuan

TL;DR

dFLMoE tackles the privacy-preserving collaboration challenge in medical data by removing centralized aggregation and server dependency. It introduces a decentralized framework where each client trains a local body and head, exchanges lightweight head models with peers, and employs a client-specific Mixture of Experts with a feature-space transform and cross-attention to fuse knowledge. The method supports both homogeneous and heterogeneous client models and demonstrates robustness to network disruptions while maintaining high performance on five non-IID medical tasks. These results highlight the practical potential of decentralized MoE-based fusion for privacy-preserving, robust medical analytics.

Abstract

Federated learning has wide applications in the medical field. It enables knowledge sharing among different healthcare institutes while protecting patients' privacy. However, existing federated learning systems are typically centralized, requiring clients to upload client-specific knowledge to a central server for aggregation. This centralized approach would integrate the knowledge from each client into a centralized server, and the knowledge would be already undermined during the centralized integration before it reaches back to each client. Besides, the centralized approach also creates a dependency on the central server, which may affect training stability if the server malfunctions or connections are unstable. To address these issues, we propose a decentralized federated learning framework named dFLMoE. In our framework, clients directly exchange lightweight head models with each other. After exchanging, each client treats both local and received head models as individual experts, and utilizes a client-specific Mixture of Experts (MoE) approach to make collective decisions. This design not only reduces the knowledge damage with client-specific aggregations but also removes the dependency on the central server to enhance the robustness of the framework. We validate our framework on multiple medical tasks, demonstrating that our method evidently outperforms state-of-the-art approaches under both model homogeneity and heterogeneity settings.

dFLMoE: Decentralized Federated Learning via Mixture of Experts for Medical Data Analysis

TL;DR

Abstract

dFLMoE: Decentralized Federated Learning via Mixture of Experts for Medical Data Analysis

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)