Simple and Effective Transfer Learning for Neuro-Symbolic Integration
Alessandro Daniele, Tommaso Campari, Sagar Malhotra, Luciano Serafini
TL;DR
The paper addresses the bottleneck in Neuro-Symbolic Integration where weak supervision from symbolic reasoning leads to slow convergence and local minima. It proposes a simple, effective transfer-learning strategy: pretrain a neural model on the downstream task and then transfer its perception encoder to NeSy models, freezing it so that only the embedding-to-symbol mapping is learned. Across multiple NeSy methods and tasks, this approach yields faster convergence, reduced local minima issues, and expanded capability to handle complex perception inputs, with modest preprocessing overhead. The findings demonstrate improved accuracy and scalability, suggesting a practical path to more reliable and generalizable NeSy systems in real-world reasoning tasks.
Abstract
Deep Learning (DL) techniques have achieved remarkable successes in recent years. However, their ability to generalize and execute reasoning tasks remains a challenge. A potential solution to this issue is Neuro-Symbolic Integration (NeSy), where neural approaches are combined with symbolic reasoning. Most of these methods exploit a neural network to map perceptions to symbols and a logical reasoner to predict the output of the downstream task. These methods exhibit superior generalization capacity compared to fully neural architectures. However, they suffer from several issues, including slow convergence, learning difficulties with complex perception tasks, and convergence to local minima. This paper proposes a simple yet effective method to ameliorate these problems. The key idea involves pretraining a neural model on the downstream task. Then, a NeSy model is trained on the same task via transfer learning, where the weights of the perceptual part are injected from the pretrained network. The key observation of our work is that the neural network fails to generalize only at the level of the symbolic part while being perfectly capable of learning the mapping from perceptions to symbols. We have tested our training strategy on various SOTA NeSy methods and datasets, demonstrating consistent improvements in the aforementioned problems.
