Proximity-based Self-Federated Learning
Davide Domini, Gianluca Aguzzi, Nicolas Farabegoli, Mirko Viroli, Lukas Esterle
TL;DR
The paper tackles the scalability and data-heterogeneity challenges of traditional federated learning by proposing proximity-based self-federated learning (PSFL), a fully distributed framework that forms regionally coherent federations around leaders using geographic proximity and data similarity. PSFL leverages self-organising coordination region concepts and gradient-based dissemination within an aggregate computing context to dynamically create, train, and distribute federation-specific models without sharing raw data. Key contributions include a ds-based dissimilarity measure, a gradient-field guidance mechanism, and a four-step federation workflow (creation, collection, aggregation, distribution) implemented via space-fluid sparse choice; empirical results on Extended MNIST Letters show PSFL outperforms centralized FedAVG, especially in highly non-IID settings. The approach holds promise for edge and decentralized deployments by enabling adaptive, privacy-preserving, region-specific model learning at scale.
Abstract
In recent advancements in machine learning, federated learning allows a network of distributed clients to collaboratively develop a global model without needing to share their local data. This technique aims to safeguard privacy, countering the vulnerabilities of conventional centralized learning methods. Traditional federated learning approaches often rely on a central server to coordinate model training across clients, aiming to replicate the same model uniformly across all nodes. However, these methods overlook the significance of geographical and local data variances in vast networks, potentially affecting model effectiveness and applicability. Moreover, relying on a central server might become a bottleneck in large networks, such as the ones promoted by edge computing. Our paper introduces a novel, fully-distributed federated learning strategy called proximity-based self-federated learning that enables the self-organised creation of multiple federations of clients based on their geographic proximity and data distribution without exchanging raw data. Indeed, unlike traditional algorithms, our approach encourages clients to share and adjust their models with neighbouring nodes based on geographic proximity and model accuracy. This method not only addresses the limitations posed by diverse data distributions but also enhances the model's adaptability to different regional characteristics creating specialized models for each federation. We demonstrate the efficacy of our approach through simulations on well-known datasets, showcasing its effectiveness over the conventional centralized federated learning framework.
