SGFusion: Stochastic Geographic Gradient Fusion in Federated Learning
Khoa Nguyen, Khang Tran, NhatHai Phan, Cristian Borcea, Ruoming Jin, Issa Khalil
TL;DR
SGFusion tackles non-IID, geography-driven federated learning by modeling inter-zone data correlations with a hierarchical random graph (HRG) and optimizing it via Markov Chain Monte Carlo (MCMC). Each zone fuses gradients from a small, stochastically sampled set of other zones, guided by self-attention weights that reflect similarity, and DP-preserving zone histograms are used to construct the HRG. Theoretical guarantees show a convergence rate of $O({}{T})$ under standard convexity and Lipschitz conditions, with complexity dominated by one-time HRG construction and linear per-round updates across zones. Empirically, SGFusion yields significant zone- and country-level utility improvements on a heart-rate prediction dataset collected across six countries, while maintaining scalable computation and communication costs. The approach offers a practical path to scalable, personalized FL in mobile sensing with strong privacy considerations.
Abstract
This paper proposes Stochastic Geographic Gradient Fusion (SGFusion), a novel training algorithm to leverage the geographic information of mobile users in Federated Learning (FL). SGFusion maps the data collected by mobile devices onto geographical zones and trains one FL model per zone, which adapts well to the data and behaviors of users in that zone. SGFusion models the local data-based correlation among geographical zones as a hierarchical random graph (HRG) optimized by Markov Chain Monte Carlo sampling. At each training step, every zone fuses its local gradient with gradients derived from a small set of other zones sampled from the HRG. This approach enables knowledge fusion and sharing among geographical zones in a probabilistic and stochastic gradient fusion process with self-attention weights, such that "more similar" zones have "higher probabilities" of sharing gradients with "larger attention weights." SGFusion remarkably improves model utility without introducing undue computational cost. Extensive theoretical and empirical results using a heart-rate prediction dataset collected across 6 countries show that models trained with SGFusion converge with upper-bounded expected errors and significantly improve utility in all countries compared to existing approaches without notable cost in system scalability.
