Federated Learning over Connected Modes
Dennis Grinwald, Philipp Wiesner, Shinichi Nakajima
TL;DR
This work tackles statistical heterogeneity in federated learning by leveraging mode connectivity to learn a linearly connected low-loss solution simplex, enabling client personalization within a shared global structure. Floco assigns each client a subregion of the simplex based on gradient signals and jointly trains the endpoints to form a globally optimal simplex, improving both local and global performance with minimal overhead. An optional Floco+ extension adds Ditto-style local fine-tuning for further personalization. Across CIFAR-10 and FEMNIST benchmarks, Floco achieves superior accuracy and calibration, reduces worst-client performance gaps, and accelerates time-to-accuracy, demonstrating practical impact for cross-silo FL with heterogeneous data. The approach also offers insights into gradient variance reduction and scalable, communication-friendly personalization via a small, last-layer simplex mechanism.
Abstract
Statistical heterogeneity in federated learning poses two major challenges: slow global training due to conflicting gradient signals, and the need of personalization for local distributions. In this work, we tackle both challenges by leveraging recent advances in \emph{linear mode connectivity} -- identifying a linearly connected low-loss region in the parameter space of neural networks, which we call solution simplex. We propose federated learning over connected modes (\textsc{Floco}), where clients are assigned local subregions in this simplex based on their gradient signals, and together learn the shared global solution simplex. This allows personalization of the client models to fit their local distributions within the degrees of freedom in the solution simplex and homogenizes the update signals for the global simplex training. Our experiments show that \textsc{Floco} accelerates the global training process, and significantly improves the local accuracy with minimal computational overhead in cross-silo federated learning settings.
