SpreadFGL: Edge-Client Collaborative Federated Graph Learning with Adaptive Neighbor Generation
Luying Zhong, Yueyang Pi, Zheyi Chen, Zhengxin Yu, Wang Miao, Xing Chen, Geyong Min
TL;DR
SpreadFGL tackles missing inter-client topology and edge-server overload in Federated Graph Learning by introducing an adaptive graph imputation generator and a versatile assessor to uncover generalized cross-subgraph links without exposing raw data. It defines FedGL as a centralized baseline and extends it to a multi-edge SpreadFGL with distributed training, where edge servers share globally inferred topology and perform joint model updates. The framework leverages an autoencoder-driven latent representation, negative sampling, and graph fixing to repair cross-subgraph connections, improving feature propagation and downstream accuracy. Empirical results on real-world testbeds and benchmarks show SpreadFGL achieving higher accuracy and faster convergence than state-of-the-art FGL methods, with ablations confirming the synergy of its components.
Abstract
Federated Graph Learning (FGL) has garnered widespread attention by enabling collaborative training on multiple clients for semi-supervised classification tasks. However, most existing FGL studies do not well consider the missing inter-client topology information in real-world scenarios, causing insufficient feature aggregation of multi-hop neighbor clients during model training. Moreover, the classic FGL commonly adopts the FedAvg but neglects the high training costs when the number of clients expands, resulting in the overload of a single edge server. To address these important challenges, we propose a novel FGL framework, named SpreadFGL, to promote the information flow in edge-client collaboration and extract more generalized potential relationships between clients. In SpreadFGL, an adaptive graph imputation generator incorporated with a versatile assessor is first designed to exploit the potential links between subgraphs, without sharing raw data. Next, a new negative sampling mechanism is developed to make SpreadFGL concentrate on more refined information in downstream tasks. To facilitate load balancing at the edge layer, SpreadFGL follows a distributed training manner that enables fast model convergence. Using real-world testbed and benchmark graph datasets, extensive experiments demonstrate the effectiveness of the proposed SpreadFGL. The results show that SpreadFGL achieves higher accuracy and faster convergence against state-of-the-art algorithms.
