On the Independence Assumption in Neurosymbolic Learning
Emile van Krieken, Pasquale Minervini, Edoardo M. Ponti, Antonio Vergari
TL;DR
This work exposes how the standard conditional independence assumption in neurosymbolic learning biases models toward deterministic inferences and impedes uncertainty quantification. It develops a rigorous framework based on prime implicants and cubical sets to exactly characterize the set of feasible independent distributions and the topology of semantic-loss minima, showing non-convexity and disconnection in general. By contrasting with fully expressive distributions, it demonstrates that independence can be overcome via expressive parameterisations, conditioning analyses, or mixtures, which restore representational capacity and enable more favorable optimization landscapes. The findings guide the design of more expressive neurosymbolic probabilistic models and motivate further study of tractable inference, continuous-variable constraints, and topology-informed regularisation to calibrate uncertainty.
Abstract
State-of-the-art neurosymbolic learning systems use probabilistic reasoning to guide neural networks towards predictions that conform to logical constraints over symbols. Many such systems assume that the probabilities of the considered symbols are conditionally independent given the input to simplify learning and reasoning. We study and criticise this assumption, highlighting how it can hinder optimisation and prevent uncertainty quantification. We prove that loss functions bias conditionally independent neural networks to become overconfident in their predictions. As a result, they are unable to represent uncertainty over multiple valid options. Furthermore, we prove that these loss functions are difficult to optimise: they are non-convex, and their minima are usually highly disconnected. Our theoretical analysis gives the foundation for replacing the conditional independence assumption and designing more expressive neurosymbolic probabilistic models.
