Independence Is Not an Issue in Neurosymbolic AI
Håkan Karlsson Faronius, Pedro Zuidberg Dos Martires
TL;DR
The paper shows that semantic loss in neurosymbolic AI is a special case of disjunctive supervision and that deterministic bias is not an inherent consequence of conditional independence when the full semantic loss is correctly applied. Through traffic-light MNIST experiments, it demonstrates that a DSIC parameterization avoids the Winner-Take-All effect seen with softmax in disjunctive supervision, while truncated semantic loss can induce deterministic bias. The results clarify the relationship between neurosymbolic learning and disjunctive supervision, cautioning against nonstandard loss formulations and highlighting the practical benefits of DSIC in weak supervision settings. Overall, the work argues that conditional independence remains a useful, not harmful, inductive bias for NeSy systems when losses are used properly, with implications for designing interpretable, data-efficient AI systems.
Abstract
A popular approach to neurosymbolic AI is to take the output of the last layer of a neural network, e.g. a softmax activation, and pass it through a sparse computation graph encoding certain logical constraints one wishes to enforce. This induces a probability distribution over a set of random variables, which happen to be conditionally independent of each other in many commonly used neurosymbolic AI models. Such conditionally independent random variables have been deemed harmful as their presence has been observed to co-occur with a phenomenon dubbed deterministic bias, where systems learn to deterministically prefer one of the valid solutions from the solution space over the others. We provide evidence contesting this conclusion and show that the phenomenon of deterministic bias is an artifact of improperly applying neurosymbolic AI.
