Aristotle: Mastering Logical Reasoning with A Logic-Complete Decompose-Search-Resolve Framework
Jundong Xu, Hao Fei, Meng Luo, Qian Liu, Liangming Pan, William Yang Wang, Preslav Nakov, Mong-Li Lee, Wynne Hsu
TL;DR
This work defines a logic-complete reasoning framework, Aristotle, that tightly couples symbolic logic with LLM-driven reasoning across decomposition, search, and resolution to address both efficacy and efficiency gaps in logical tasks. The architecture comprises a Translator, Decomposer, Search Router, and Resolver, enabling a full decompose-search-resolve cycle that operates on CNF-normalized premises via a proof-by-contradiction scheme. Empirical results across ProntoQA, ProofWriter, and LogicNLI show that Aristotle consistently outperforms state-of-the-art baselines, with especially large gains on complex logical tasks and demonstrated generalizability across models such as GPT-4, GPT-4o, Claude, and LLaMA. The findings highlight near-perfect one-step reasoning accuracy in the Resolver, reduced search errors, and stable cost scaling with reasoning depth, underscoring the practical impact of integrating symbolic logic into end-to-end LLM reasoning. The work also provides open-source code to encourage adoption and further research in reliable, scalable logical reasoning with LLMs.
Abstract
In the context of large language models (LLMs), current advanced reasoning methods have made impressive strides in various reasoning tasks. However, when it comes to logical reasoning tasks, major challenges remain in both efficacy and efficiency. This is rooted in the fact that these systems fail to fully leverage the inherent structure of logical tasks throughout the reasoning processes such as decomposition, search, and resolution. To address this, we propose a logic-complete reasoning framework, Aristotle, with three key components: Logical Decomposer, Logical Search Router, and Logical Resolver. In our framework, symbolic expressions and logical rules are comprehensively integrated into the entire reasoning process, significantly alleviating the bottlenecks of logical reasoning, i.e., reducing sub-task complexity, minimizing search errors, and resolving logical contradictions. The experimental results on several datasets demonstrate that Aristotle consistently outperforms state-of-the-art reasoning frameworks in both accuracy and efficiency, particularly excelling in complex logical reasoning scenarios. We will open-source all our code at https://llm-symbol.github.io/Aristotle/.
