Neurosymbolic Decision Trees
Matthias Möller, Arvid Norlander, Pedro Zuidberg Dos Martires, Luc De Raedt
TL;DR
The paper tackles learning both the symbolic structure and neural parameters in neurosymbolic AI by introducing neurosymbolic decision trees (NDTs) and the NeuID3 learning algorithm. It merges top-down decision-tree induction with neural probabilistic logic (DeepProbLog) to support symbolic and subsymbolic inputs and to leverage background knowledge. Empirical results across UCI datasets, MNIST/EMNIST-based subsymbolic tasks, and Eleusis demonstrate that NLDTs can outperform purely neural approaches, particularly when background knowledge is provided and neural tests are reused. This work advances data-efficient, modular neurosymbolic models with interpretable structure, offering a natural path toward continual learning and knowledge-driven AI systems.
Abstract
Neurosymbolic (NeSy) AI studies the integration of neural networks (NNs) and symbolic reasoning based on logic. Usually, NeSy techniques focus on learning the neural, probabilistic and/or fuzzy parameters of NeSy models. Learning the symbolic or logical structure of such models has, so far, received less attention. We introduce neurosymbolic decision trees (NDTs), as an extension of decision trees together with a novel NeSy structure learning algorithm, which we dub NeuID3. NeuID3 adapts the standard top-down induction of decision tree algorithms and combines it with a neural probabilistic logic representation, inherited from the DeepProbLog family of models. The key advantage of learning NDTs with NeuID3 is the support of both symbolic and subsymbolic data (such as images), and that they can exploit background knowledge during the induction of the tree structure, In our experimental evaluation we demonstrate the benefits of NeSys structure learning over more traditonal approaches such as purely data-driven learning with neural networks.
