On the Hardness of Probabilistic Neurosymbolic Learning
Jaron Maene, Vincent Derkinderen, Luc De Raedt
TL;DR
The paper tackles the gradient differentiation challenge in probabilistic neurosymbolic learning by linking gradient computation to Weighted Model Counting (WMC), proving intractability in general but tractability during training as networks become confident. It introduces WeightME, an unbiased gradient estimator based on weighted model sampling, with PAC-style guarantees and a logarithmic number of SAT calls, offering solid theoretical and practical benefits. The authors also critically evaluate biased WMS approaches, showing they struggle to optimize even when exact inference is feasible, and provide extensive experiments on CNF benchmarks to support these findings. Overall, the work highlights the need for principled, SAT-backed gradient estimators in neurosymbolic learning and outlines clear directions for extending these results to more expressive settings.
Abstract
The limitations of purely neural learning have sparked an interest in probabilistic neurosymbolic models, which combine neural networks with probabilistic logical reasoning. As these neurosymbolic models are trained with gradient descent, we study the complexity of differentiating probabilistic reasoning. We prove that although approximating these gradients is intractable in general, it becomes tractable during training. Furthermore, we introduce WeightME, an unbiased gradient estimator based on model sampling. Under mild assumptions, WeightME approximates the gradient with probabilistic guarantees using a logarithmic number of calls to a SAT solver. Lastly, we evaluate the necessity of these guarantees on the gradient. Our experiments indicate that the existing biased approximations indeed struggle to optimize even when exact solving is still feasible.
