Neurosymbolic Language Reasoning as Satisfiability Modulo Theory

Hyunseok Oh; Sam Stern; Youngki Lee; Matthai Philipose

Neurosymbolic Language Reasoning as Satisfiability Modulo Theory

Hyunseok Oh, Sam Stern, Youngki Lee, Matthai Philipose

TL;DR

This work develops an algorithm that integrates LLM-based constraint evaluation with satisfiability modulo theory (SMT) solving, enabling joint textual-logical reasoning and is the first that treats LLM-based reasoning as an SMT theory, extending neurosymbolic methods beyond fully formalizable domains.

Abstract

Natural language understanding requires interleaving textual and logical reasoning, yet large language models often fail to perform such reasoning reliably. Existing neurosymbolic systems combine LLMs with solvers but remain limited to fully formalizable tasks such as math or program synthesis, leaving natural documents with only partial logical structure unaddressed. We introduce Logitext, a neurosymbolic language that represents documents as natural language text constraints (NLTCs), making partial logical structure explicit. We develop an algorithm that integrates LLM-based constraint evaluation with satisfiability modulo theory (SMT) solving, enabling joint textual-logical reasoning. Experiments on a new content moderation benchmark, together with LegalBench and Super-Natural Instructions, show that Logitext improves both accuracy and coverage. This work is the first that treats LLM-based reasoning as an SMT theory, extending neurosymbolic methods beyond fully formalizable domains.

Neurosymbolic Language Reasoning as Satisfiability Modulo Theory

TL;DR

Abstract

Paper Structure (36 sections, 3 equations, 15 figures, 3 tables, 7 algorithms)

This paper contains 36 sections, 3 equations, 15 figures, 3 tables, 7 algorithms.

Introduction
Interleaved text/logic reasoning in text understanding
Compositional vs combinatorial reasoning in natural text
Logical reasoning gaps
Logitext: Language, representation and solver
The Logitext language
Natural language text constraints
Solving natural language text constraints
Evaluation
Benchmarks and setup
Results
Related work
Conclusion
Appendix
Definitions of gaps
...and 21 more sections

Figures (15)

Figure 1: Example: A content moderation policy (a) illustrates a fine-grained mix of logical and textual constraints. Combined with (b), it yields a prompt that requires compositional logical reasoning, and with (c-d), combinatorial reasoning.
Figure 2: Gaps in logical reasoning (see App. \ref{['sub:gap-defs']}) on content moderation across LLMs.
Figure 3: Content moderation policy example implemented as Logitext Document
Figure 5: End-to-End performance comparison per task.
Figure 6: NLSolver success rate vs num. iterations.
...and 10 more figures

Theorems & Definitions (2)

Definition 1: Compositional gap $\Delta_{mp}$ of LLM $m$ on prompt $p$
Definition 2: Combinatorial gap $\Delta'_{mp}$ of model LLM $m$ on prompt $p$

Neurosymbolic Language Reasoning as Satisfiability Modulo Theory

TL;DR

Abstract

Neurosymbolic Language Reasoning as Satisfiability Modulo Theory

Authors

TL;DR

Abstract

Table of Contents

Figures (15)

Theorems & Definitions (2)