Federated Learning Can Find Friends That Are Advantageous
Nazarii Tupitsa, Samuel Horváth, Martin Takáč, Eduard Gorbunov
TL;DR
MeritFed introduces adaptive, merit-based aggregation in Federated Learning to leverage beneficial cross-client collaborations while mitigating detrimental ones. By solving a local auxiliary problem to optimize aggregation weights on the simplex, the server can weight gradients from all clients, achieving convergence guarantees that match or exceed using only same-distribution clients and exhibiting improved performance in heterogeneous settings. The method is accompanied by two practical approaches for the auxiliary problem—one using fresh data with zeroth-order Mirror Descent and another leveraging an extra validation dataset—along with theoretical guarantees under standard smoothness and variance assumptions and empirical validation on mean estimation, image, and text tasks. The work highlights the practical impact of judicious client selection in FL and lays groundwork for scalable, robust, and more efficient collaborative learning across diverse data distributions.
Abstract
In Federated Learning (FL), the distributed nature and heterogeneity of client data present both opportunities and challenges. While collaboration among clients can significantly enhance the learning process, not all collaborations are beneficial; some may even be detrimental. In this study, we introduce a novel algorithm that assigns adaptive aggregation weights to clients participating in FL training, identifying those with data distributions most conducive to a specific learning objective. We demonstrate that our aggregation method converges no worse than the method that aggregates only the updates received from clients with the same data distribution. Furthermore, empirical evaluations consistently reveal that collaborations guided by our algorithm outperform traditional FL approaches. This underscores the critical role of judicious client selection and lays the foundation for more streamlined and effective FL implementations in the coming years.
