AutoReason: Automatic Few-Shot Reasoning Decomposition
Arda Sevinc, Abdurrahman Gumus
TL;DR
AutoReason proposes automatic generation of Chain-of-Thought rationales to decompose implicit queries into explicit sub-questions, enabling a zero-shot prompt to behave like per-query few-shot reasoning traces. The system uses a two-stage process where a strong LLM (GPT-4) generates rationales that guide a weaker LLM (GPT-3.5-Turbo) to produce final answers, improving reasoning on StrategyQA and showing mixed results on HotpotQA. Results indicate substantial gains on StrategyQA, demonstrating the potential for per-query rationale generation to enhance interpretability and accuracy, while highlighting task-dependent limitations. The work contributes a scalable methodology for interpretability-driven prompting and provides release of the code to foster further research.
Abstract
Chain of Thought (CoT) was introduced in recent research as a method for improving step-by-step reasoning in Large Language Models. However, CoT has limited applications such as its need for hand-crafted few-shot exemplar prompts and no capability to adjust itself to different queries. In this work, we propose a system to automatically generate rationales using CoT. Our method improves multi-step implicit reasoning capabilities by decomposing the implicit query into several explicit questions. This provides interpretability for the model, improving reasoning in weaker LLMs. We test our approach with two Q\&A datasets: StrategyQA and HotpotQA. We show an increase in accuracy with both, especially on StrategyQA. To facilitate further research in this field, the complete source code for this study has been made publicly available on GitHub: https://github.com/miralab-ai/autoreason.
