Federated Neuro-Symbolic Learning
Pengwei Xing, Songtao Lu, Han Yu
TL;DR
This work extends neuro-symbolic learning to federated settings by modeling rule distributions as the communication medium between server and clients. It introduces a distribution-coupled bilevel optimization framework, solved with a tailored variational EM that jointly learns a global prior over rules and local posteriors, while enforcing a KL-divergence constraint to mitigate rule heterogeneity across domains. The approach leverages a transformer-based rule generator and a KG-aware E-step to efficiently search and score candidate rules, substantially reducing the rule-search space. Empirical results on synthetic and real-world data demonstrate notable gains in unbalanced training accuracy and unseen testing accuracy, validating FedNSL’s ability to perform personalized NSL under FL while preserving privacy and improving cross-domain generalization.
Abstract
Neuro-symbolic learning (NSL) models complex symbolic rule patterns into latent variable distributions by neural networks, which reduces rule search space and generates unseen rules to improve downstream task performance. Centralized NSL learning involves directly acquiring data from downstream tasks, which is not feasible for federated learning (FL). To address this limitation, we shift the focus from such a one-to-one interactive neuro-symbolic paradigm to one-to-many Federated Neuro-Symbolic Learning framework (FedNSL) with latent variables as the FL communication medium. Built on the basis of our novel reformulation of the NSL theory, FedNSL is capable of identifying and addressing rule distribution heterogeneity through a simple and effective Kullback-Leibler (KL) divergence constraint on rule distribution applicable under the FL setting. It further theoretically adjusts variational expectation maximization (V-EM) to reduce the rule search space across domains. This is the first incorporation of distribution-coupled bilevel optimization into FL. Extensive experiments based on both synthetic and real-world data demonstrate significant advantages of FedNSL compared to five state-of-the-art methods. It outperforms the best baseline by 17% and 29% in terms of unbalanced average training accuracy and unseen average testing accuracy, respectively.
