Improved Logical Reasoning of Language Models via Differentiable Symbolic Programming

Hanlin Zhang; Jiani Huang; Ziyang Li; Mayur Naik; Eric Xing

Improved Logical Reasoning of Language Models via Differentiable Symbolic Programming

Hanlin Zhang, Jiani Huang, Ziyang Li, Mayur Naik, Eric Xing

TL;DR

<3-5 sentence high-level summary> The paper tackles the poor logical reasoning capabilities of pre-trained LMs by introducing DSR-LM, a differentiable neuro-symbolic framework in which a perception LM extracts probabilistic relations and a differentiable symbolic engine performs deductive reasoning with learned rules. It adds semantic loss via integrity constraints and trains rule weights jointly with the LM, enabling end-to-end optimization and interpretable rule induction. Empirical results on CLUTRR and DBpedia-INF show substantial gains in deductive accuracy and stronger generalization to long reasoning chains, outperforming a broad set of baselines including GPT-3 variants. The approach demonstrates the value of integrating differentiable symbolic programming with neural perception to improve robustness and interpretability in reasoning tasks.

Abstract

Pre-trained large language models (LMs) struggle to perform logical reasoning reliably despite advances in scale and compositionality. In this work, we tackle this challenge through the lens of symbolic programming. We propose DSR-LM, a Differentiable Symbolic Reasoning framework where pre-trained LMs govern the perception of factual knowledge, and a symbolic module performs deductive reasoning. In contrast to works that rely on hand-crafted logic rules, our differentiable symbolic reasoning framework efficiently learns weighted rules and applies semantic loss to further improve LMs. DSR-LM is scalable, interpretable, and allows easy integration of prior knowledge, thereby supporting extensive symbolic programming to robustly derive a logical conclusion. The results of our experiments suggest that DSR-LM improves the logical reasoning abilities of pre-trained language models, resulting in a significant increase in accuracy of over 20% on deductive reasoning benchmarks. Furthermore, DSR-LM outperforms a variety of competitive baselines when faced with systematic changes in sequence length.

Improved Logical Reasoning of Language Models via Differentiable Symbolic Programming

TL;DR

Abstract

Paper Structure (32 sections, 6 equations, 6 figures, 9 tables)

This paper contains 32 sections, 6 equations, 6 figures, 9 tables.

Introduction
Related Work
Methodology
Problem Formulation
Methodology Overview
Relation Extraction
Differentiable Symbolic Inference
Logical deduction.
Probability propagation.
Rule learning.
Semantic loss and integrity constraints.
Experiments
Datasets
Experimental Setup
Implementation.
...and 17 more sections

Figures (6)

Figure 1: Overview of DSR-LM with a motivating example where "Anne is the niece of Dorothy" should be logically inferred from the context. We abbreviate the names with their first initials in the relational symbols.
Figure 2: The Scallop program used in the CLUTRR reasoning task.
Figure 3: DSR-LM's performance on CLUTRR compared with various baselines
Figure 3: DBpedia-INF generalization evaluation under different test reasoning length. Models are trained on 10K reasoning length $k=0$ sequences, and tested on sequences of reasoning length $k=[0, 5]$.
Figure 4: Systematic generalization performance comparison on CLUTRR dataset. Models except GPT-3-ZS*, GPT-3-FS are trained (or fine-tuned) on $k \in \{2, 3\}$. All models are tested on $k\in\{2,\dots,10\}$.
...and 1 more figures

Improved Logical Reasoning of Language Models via Differentiable Symbolic Programming

TL;DR

Abstract

Improved Logical Reasoning of Language Models via Differentiable Symbolic Programming

Authors

TL;DR

Abstract

Table of Contents

Figures (6)