Differentially Private and Federated Structure Learning in Bayesian Networks
Ghita Fassy El Fehri, Aurélien Bellet, Philippe Bastien
TL;DR
The paper tackles learning Bayesian network structures from distributed data under stringent privacy and communication constraints.It introduces Fed-Sparse-BNSL, a sparsity-driven federated approach, and its differentially private variant DP-Fed-Sparse-BNSL, both leveraging Proximal Greedy Coordinate Descent to maintain identifiability while exchanging only sparse edge updates.A careful DP-PGCD design using the exponential mechanism for coordinate selection and Gaussian gradient perturbation, analyzed under zCDP, yields strong privacy-utility trade-offs, especially in high dimensions.Extensive experiments on synthetic and real data show substantial communication reductions, competitive structure recovery, and effective participant-level personalization, with clear privacy-utility behavior as the budget changes.
Abstract
Learning the structure of a Bayesian network from decentralized data poses two major challenges: (i) ensuring rigorous privacy guarantees for participants, and (ii) avoiding communication costs that scale poorly with dimensionality. In this work, we introduce Fed-Sparse-BNSL, a novel federated method for learning linear Gaussian Bayesian network structures that addresses both challenges. By combining differential privacy with greedy updates that target only a few relevant edges per participant, Fed-Sparse-BNSL efficiently uses the privacy budget while keeping communication costs low. Our careful algorithmic design preserves model identifiability and enables accurate structure estimation. Experiments on synthetic and real datasets demonstrate that Fed-Sparse-BNSL achieves utility close to non-private baselines while offering substantially stronger privacy and communication efficiency.
