Initialisation and Network Effects in Decentralised Federated Learning
Arash Badie-Modiri, Chiara Boldrini, Lorenzo Valerio, János Kertész, Márton Karsai
TL;DR
The paper tackles the challenge that fully decentralised federated learning performance hinges on network topology and initial parameter conditions. It proposes a topology-aware uncoordinated initialisation based on the distribution of eigenvector centralities, corrected by the steady-state norm $||v_{steady}||$, to prevent detrimental parameter compression from repeated neighbourhood averaging. A simple numerical model and Markov-chain analysis link early dynamics to gossip-like aggregation and mixing times, yielding practical formulas and exact/approximate gain-correction strategies that make decentralised training approach the efficiency of a centralised baseline given the same total data. Empirical validation across multiple datasets and architectures demonstrates robustness to topology and partial connectivity, with insights into how network density, data per node, system size, and communication frequency influence scalability. The work provides design guidance for scalable, uncoordinated distributed training and highlights the critical role of network structure in learning dynamics, while acknowledging limitations and avenues for future extension.
Abstract
Fully decentralised federated learning enables collaborative training of individual machine learning models on a distributed network of communicating devices while keeping the training data localised on each node. This approach avoids central coordination, enhances data privacy and eliminates the risk of a single point of failure. Our research highlights that the effectiveness of decentralised federated learning is significantly influenced by the network topology of connected devices and the learning models' initial conditions. We propose a strategy for uncoordinated initialisation of the artificial neural networks based on the distribution of eigenvector centralities of the underlying communication network, leading to a radically improved training efficiency. Additionally, our study explores the scaling behaviour and the choice of environmental parameters under our proposed initialisation strategy. This work paves the way for more efficient and scalable artificial neural network training in a distributed and uncoordinated environment, offering a deeper understanding of the intertwining roles of network structure and learning dynamics.
