Softened Symbol Grounding for Neuro-symbolic Systems
Zenan Li, Yuan Yao, Taolue Chen, Jingwei Xu, Chun Cao, Xiaoxing Ma, Jian Lü
TL;DR
This work tackles the symbol grounding bottleneck in neuro-symbolic systems by proposing a softened grounding framework that models the input-symbol mapping as a Boltzmann distribution $Q_{\boldsymbol{\phi}}$ and gradually sharpens it through annealing. It introduces a projection-based MCMC sampling scheme, aided by SMT solvers, to efficiently explore the feasible symbol space $\mathcal{S}_{\mathbf{y}}$, and provides a convergence analysis for stochastic optimization with biased gradient estimates. A two-stage training protocol (annealing Stage I and zero-degree Stage II) is demonstrated across handwriting formula evaluation, visual Sudoku, and shortest-path tasks, showing superior performance and grounding efficiency versus state-of-the-art baselines. The results suggest that explicit, probabilistic symbol grounding coupled with targeted projection sampling can significantly enhance the integration of neural perception and symbolic reasoning, with broad implications for scalable neuro-symbolic learning and semi-supervised reasoning.
Abstract
Neuro-symbolic learning generally consists of two separated worlds, i.e., neural network training and symbolic constraint solving, whose success hinges on symbol grounding, a fundamental problem in AI. This paper presents a novel, softened symbol grounding process, bridging the gap between the two worlds, and resulting in an effective and efficient neuro-symbolic learning framework. Technically, the framework features (1) modeling of symbol solution states as a Boltzmann distribution, which avoids expensive state searching and facilitates mutually beneficial interactions between network training and symbolic reasoning;(2) a new MCMC technique leveraging projection and SMT solvers, which efficiently samples from disconnected symbol solution spaces; (3) an annealing mechanism that can escape from %being trapped into sub-optimal symbol groundings. Experiments with three representative neuro symbolic learning tasks demonstrate that, owining to its superior symbol grounding capability, our framework successfully solves problems well beyond the frontier of the existing proposals.
