ChatLogic: Integrating Logic Programming with Large Language Models for Multi-Step Reasoning
Zhongsheng Wang, Jiamou Liu, Qiming Bao, Hongfei Rong, Jingfeng Zhang
TL;DR
ChatLogic tackles the challenge of robust multi-step reasoning in LLMs by integrating a symbolic logic engine via pyDatalog, converting natural language problems into logic programs, and executing them locally. The framework combines semantic and syntax corrections with a Mix-shot Chain of Thought prompting to guide LLMs toward correct symbolic representations and reliable code generation. Across PARARULE-Plus and CONCEPTRULES datasets, ChatLogic consistently improves reasoning accuracy and code executability, with notable gains for smaller models and high-depth tasks. This approach enhances transparency, traceability, and practicality of LLM-based reasoning in real-world multi-step tasks.
Abstract
Large language models (LLMs) such as ChatGPT and GPT-4 have demonstrated impressive capabilities in various generative tasks. However, their performance is often hampered by limitations in accessing and leveraging long-term memory, leading to specific vulnerabilities and biases, especially during long interactions. This paper introduces ChatLogic, an innovative framework specifically targeted at LLM reasoning tasks that can enhance the performance of LLMs in multi-step deductive reasoning tasks by integrating logic programming. In ChatLogic, the language model plays a central role, acting as a controller and participating in every system operation stage. We propose a novel method of converting logic problems into symbolic integration with an inference engine. This approach leverages large language models' situational understanding and imitation skills and uses symbolic memory to enhance multi-step deductive reasoning capabilities. Our results show that the ChatLogic framework significantly improves the multi-step reasoning capabilities of LLMs. The source code and data are available at \url{https://github.com/Strong-AI-Lab/ChatLogic}
