DCR: Divide-and-Conquer Reasoning for Multi-choice Question Answering with LLMs
Zijie Meng, Yan Zhang, Zhaopeng Feng, Zuozhu Liu
TL;DR
This work tackles the uniform-processing bias in LLM-based MCQ reasoning by introducing Divide and Conquer Reasoning (DCR), which partitions questions according to a confidence score $\mathcal{CS}$ estimated from multiple Zero-Shot-CoT runs and then applies Filter Choices based Reasoning (FCR) to the low-$\mathcal{CS}$ subset. The method reduces inference cost to about 85% of the state of the art while delivering a solid average improvement of $1.56\%$ across nine diverse datasets spanning arithmetic, commonsense, and logic tasks, and proves effective across multiple LLMs. Key contributions include pioneering dataset-level division for LLM reasoning, demonstrating cost-accuracy tradeoffs through $\mathcal{CS}$-based partitioning and FCR, and validating generalization to cloze-style data like GSM8K. The results highlight practical gains in efficiency and accuracy, with insights into the relationship between confidence, distractors, and optimal allocation of reasoning resources.
Abstract
Large language models (LLMs) have shown impressive performance in reasoning benchmarks with the emergence of Chain-of-Thought (CoT), particularly in multi-choice question (MCQ). However, current works equally resolve questions regardless of the problem-solving difficulty, leading to an excessive focus on simple items while insufficient attention on intricate ones. To address this challenge, we propose a simple yet effective strategy, Divide and Conquer Reasoning (DCR), to enhance the reasoning capability of LLMs for MCQs, as inspired by human beings using heuristics to first categorize tasks and then handle them separately. In particular, we first categorize questions into two subsets based on confidence score ($\mathcal{CS}$), which is estimated by statistical frequency of generated answers. Subsequently, we propose Filter Choices based Reasoning (FCR) to improve model performance on MCQs with low ($\mathcal{CS}$). Our experiments demonstrate that the proposed strategy only costs 85% of SOTA, while still achieves average accuracy improvement of 1.56% across nine datasets including arithmetic, commonsense, and logic reasoning tasks. The code is at \url{https://github.com/AiMijie/Divide-and-Conquer}
