Recursive Decomposition of Logical Thoughts: Framework for Superior Reasoning and Knowledge Propagation in Large Language Models
Kaleem Ullah Qasim, Jiashu Zhang, Tariq Alsahfi, Ateeq Ur Rehman Butt
TL;DR
RDoLT tackles core weaknesses in large language model reasoning by integrating three components: recursive task decomposition into Easy/Intermediate/Final levels, a four-feature thought scoring system, and a Knowledge Propagation Module that preserves both strong and weak thoughts across reasoning stages. The framework combines elements from CoT, CoT-SC, and graph-based reasoning while maximizing flexibility and error recovery through cross-stage knowledge propagation. Empirical results across GSM8K, SVAMP, MultiArith, LastLetterConcatenation, and Gaokao 2023 Math show consistent improvements over state-of-the-art prompting methods, with notable gains on open models and ChatGPT-4o, and robust performance across varied thresholds and thought-quantity configurations. These findings suggest RDoLT offers a scalable, generalizable approach to complex reasoning tasks, with open-source releases to foster broader adoption and further research.
Abstract
Enhancing the reasoning capabilities of Large Language Models remains a critical challenge in artificial intelligence. We introduce RDoLT, Recursive Decomposition of Logical Thought prompting, a novel framework that significantly boosts LLM reasoning performance. RDoLT is built on three key innovations: (1) recursively breaking down complex reasoning tasks into sub-tasks of progressive complexity; (2) employing an advanced selection and scoring mechanism to identify the most promising reasoning thoughts; and (3) integrating a knowledge propagation module that mimics human learning by keeping track of strong and weak thoughts for information propagation. Our approach was evaluated across multiple benchmarks, including GSM8K, SVAMP, MultiArith, LastLetterConcatenation, and Gaokao2023 Math. The results demonstrate that RDoLT consistently outperforms existing state-of-the-art techniques, achieving a 90.98 percent accuracy on GSM8K with ChatGPT-4, surpassing state-of-the-art techniques by 6.28 percent. Similar improvements were observed on other benchmarks, with accuracy gains ranging from 5.5 percent to 6.75 percent. These findings highlight RDoLT's potential to advance prompt engineering, offering a more effective and generalizable approach to complex reasoning tasks.
