NELLIE: A Neuro-Symbolic Inference Engine for Grounded, Compositional, and Explainable Reasoning
Nathaniel Weir, Peter Clark, Benjamin Van Durme
TL;DR
Nellie tackles the challenge of interpretable QA by grounding answers in an external NL fact corpus through end-to-end entailment-tree proofs, addressing both interpretability and hallucination in large language models. It combines a Prolog-style backward-chaining prover with neural predicates, retrieval-guided rule generation, and template-conditioned decomposition to produce grounded, explainable proofs. The architecture, evaluated on WorldTree and EntailmentBank, matches or surpasses similar-sized baselines and demonstrates domain generalization to OpenBookQA when common knowledge is incorporated, highlighting the practical value of jointly leveraging neural reasoning and symbolic inference. Collectively, Nellie demonstrates a scalable, grounded approach to explainable QA that bridges modern neural methods with traditional symbolic reasoning, enabling robust, human-interpretable explanations grounded in textual knowledge.
Abstract
Our goal is a modern approach to answering questions via systematic reasoning where answers are supported by human interpretable proof trees grounded in an NL corpus of authoritative facts. Such a system would help alleviate the challenges of interpretability and hallucination with modern LMs, and the lack of grounding of current explanation methods (e.g., Chain-of-Thought). This paper proposes a new take on Prolog-based inference engines, where we replace handcrafted rules with a combination of neural language modeling, guided generation, and semiparametric dense retrieval. Our implementation, NELLIE, is the first system to demonstrate fully interpretable, end-to-end grounded QA as entailment tree proof search, going beyond earlier work explaining known-to-be-true facts from text. In experiments, NELLIE outperforms a similar-sized state-of-the-art reasoner [Tafjord et al., 2022] while producing knowledge-grounded explanations. We also find NELLIE can exploit both semi-structured and NL text corpora to guide reasoning. Together these suggest a new way to jointly reap the benefits of both modern neural methods and traditional symbolic reasoning.
