FedGES: A Federated Learning Approach for BN Structure Learning
Pablo Torrijos, José A. Gámez, José M. Puerta
TL;DR
The paper tackles privacy concerns in Bayesian Network structure learning by introducing FedGES, a federated learning framework that uses constrained GES locally at clients and structural fusion at a server to produce a global BN structure without sharing data or parameters. FedGES preserves the theoretical properties of GES under a distributed, structure-only communication model and employs controlled fusion strategies to promote consensus while limiting complexity. Through extensive experiments on 14 bnlearn BN benchmarks, FedGES demonstrates superior performance over several federated and non-federated baselines, particularly for large or high-dimensional networks and when data are partitioned among many clients. The work offers a practical, privacy-preserving approach to BN structure learning with scalable convergence, reproducible implementation, and clear avenues for future enhancements such as handling non-IID data and adding federated parameter learning.
Abstract
Bayesian Network (BN) structure learning traditionally centralizes data, raising privacy concerns when data is distributed across multiple entities. This research introduces Federated GES (FedGES), a novel Federated Learning approach tailored for BN structure learning in decentralized settings using the Greedy Equivalence Search (GES) algorithm. FedGES uniquely addresses privacy and security challenges by exchanging only evolving network structures, not parameters or data. It realizes collaborative model development, using structural fusion to combine the limited models generated by each client in successive iterations. A controlled structural fusion is also proposed to enhance client consensus when adding any edge. Experimental results on various BNs from {\sf bnlearn}'s BN Repository validate the effectiveness of FedGES, particularly in high-dimensional (a large number of variables) and sparse data scenarios, offering a practical and privacy-preserving solution for real-world BN structure learning.
