Deep Symbolic Learning: Discovering Symbols and Rules from Perceptions
Alessandro Daniele, Tommaso Campari, Sagar Malhotra, Luciano Serafini
TL;DR
Deep Symbolic Learning (DSL) presents a Neuro-Symbolic framework that jointly learns perception functions and internal symbolic representations, addressing the symbol grounding problem within a fully differentiable pipeline. It introduces policy-based discrete symbol selection to embed interpretable symbols and learn symbolic rules from data, supervised only on the composed NeSy function. DSL supports both direct and recurrent NeSy-functions and learns the symbolic function g via a learnable weight tensor, enabling gradient flow through discrete choices. Empirical results on MNIST-based tasks (sum, parity, and multi-digit sum) demonstrate competitive accuracy, effective symbol grounding, and strong generalization to longer sequences and transfer to related tasks, with scalable inference for very long inputs. This approach offers a unified, end-to-end method for discovering and grounding symbols and rules from perception without heavy prior biases on the symbolic structure.
Abstract
Neuro-Symbolic (NeSy) integration combines symbolic reasoning with Neural Networks (NNs) for tasks requiring perception and reasoning. Most NeSy systems rely on continuous relaxation of logical knowledge, and no discrete decisions are made within the model pipeline. Furthermore, these methods assume that the symbolic rules are given. In this paper, we propose Deep Symbolic Learning (DSL), a NeSy system that learns NeSy-functions, i.e., the composition of a (set of) perception functions which map continuous data to discrete symbols, and a symbolic function over the set of symbols. DSL learns simultaneously the perception and symbolic functions while being trained only on their composition (NeSy-function). The key novelty of DSL is that it can create internal (interpretable) symbolic representations and map them to perception inputs within a differentiable NN learning pipeline. The created symbols are automatically selected to generate symbolic functions that best explain the data. We provide experimental analysis to substantiate the efficacy of DSL in simultaneously learning perception and symbolic functions.
