FedVCK: Non-IID Robust and Communication-Efficient Federated Learning via Valuable Condensed Knowledge for Medical Image Analysis
Guochen Yan, Luyuan Xie, Xinyi Gao, Wentao Zhang, Qingni Shen, Yuejian Fang, Zhonghai Wu
TL;DR
FedVCK tackles non-IID data in federated medical image analysis by condensing high-quality, necessary knowledge from each client into a small dataset using latent distribution constraints, and by updating the global model through relational prototype-wise contrastive learning at the server. The method combines distribution-matching condensation, latent-statistics fixing, and model-guided, importance-weighted data selection to minimize repeated knowledge and communication, while preserving privacy via random initialization of the condensed data. Empirical results across medical and natural datasets show FedVCK achieving state-of-the-art performance under non-IID and limited-communication regimes, with robust privacy properties. The approach offers a practical, scalable path for cross-institutional medical image analysis with strong non-IID robustness and communication efficiency.
Abstract
Federated learning has become a promising solution for collaboration among medical institutions. However, data owned by each institution would be highly heterogeneous and the distribution is always non-independent and identical distribution (non-IID), resulting in client drift and unsatisfactory performance. Despite existing federated learning methods attempting to solve the non-IID problems, they still show marginal advantages but rely on frequent communication which would incur high costs and privacy concerns. In this paper, we propose a novel federated learning method: \textbf{Fed}erated learning via \textbf{V}aluable \textbf{C}ondensed \textbf{K}nowledge (FedVCK). We enhance the quality of condensed knowledge and select the most necessary knowledge guided by models, to tackle the non-IID problem within limited communication budgets effectively. Specifically, on the client side, we condense the knowledge of each client into a small dataset and further enhance the condensation procedure with latent distribution constraints, facilitating the effective capture of high-quality knowledge. During each round, we specifically target and condense knowledge that has not been assimilated by the current model, thereby preventing unnecessary repetition of homogeneous knowledge and minimizing the frequency of communications required. On the server side, we propose relational supervised contrastive learning to provide more supervision signals to aid the global model updating. Comprehensive experiments across various medical tasks show that FedVCK can outperform state-of-the-art methods, demonstrating that it's non-IID robust and communication-efficient.
