Reliable Reasoning Beyond Natural Language
Nasim Borazjanizadeh, Steven T. Piantadosi
TL;DR
The paper addresses brittle reasoning in LLMs caused by sequential next-token prediction and natural-language limitations by introducing the Non-Linear Reasoning (NLR) dataset and a neurosymbolic pipeline that offloads iterative deduction to Prolog. By translating problem information into Prolog code and using a Multiple-Try inference loop, the approach achieves substantial gains on GSM8k and Navigate and near-perfect accuracy on NLR, especially as variable interdependence increases. The work demonstrates that combining neural language understanding with symbolic reasoning yields reliable, non-linear deduction and backtracking capabilities beyond what text-only prompting can achieve. This has practical implications for deploying AI systems in domains requiring robust, interpretable reasoning across complex relational structures.
Abstract
Despite their linguistic competence, Large Language Models (LLMs) often struggle to reason reliably and flexibly. To identify these shortcomings, we introduce the Non-Linear Reasoning (NLR) dataset, a collection of 55 unique, hand-designed problems that target reasoning bottlenecks arising from the sequential prediction paradigm of LLMs and the inherently linear nature of natural language. NLR tasks require iterative updates, backtracking, and reasoning across multiple parallel chains of thought but only basic arithmetic to solve. To address these limitations, we propose a neurosymbolic reasoning approach that integrates Prolog, a symbolic reasoning engine, into the inference pipeline of LLMs. This division of labor shifts the LLM's task from iterative computations to inferring all information, explicit or implied through common sense, and encoding it as logical code. Our method yields large and robust performance gains across the GSM8k and BIG-bench Navigate benchmarks and achieves near-perfect accuracy on NLR problems, maintaining robustness even as variable interdependence - the number of other variables on which the value of a single variable depends - increases.
