DynaThink: Fast or Slow? A Dynamic Decision-Making Framework for Large Language Models
Jiabao Pan, Yan Zhang, Chen Zhang, Zuozhu Liu, Hongwei Wang, Haizhou Li
TL;DR
This work tackles the efficiency–accuracy tension in large language model reasoning by introducing DynaThink, a dynamic decision-making framework that routes problems to Fast or Slow inference paths. The routing rests on two criteria: consistency verification (requiring $> \tfrac{1}{2}$ of votes) and reasoning-complexity verification (favoring fewer steps), enabling high-confidence, low-cost solutions when possible and more thorough multi-path reasoning when needed. Across six reasoning benchmarks and multiple LLMs, DynaThink improves both accuracy and efficiency relative to a Self-Consistency baseline, with notable gains in zero-shot and few-shot settings and compatible gains when combined with SelfCheck. The approach offers practical value for resource-constrained deployments and highlights a path toward more adaptive, cost-aware LLM reasoning.
Abstract
Large language models (LLMs) have demonstrated emergent capabilities across diverse reasoning tasks via popular Chains-of-Thought (COT) prompting. However, such a simple and fast COT approach often encounters limitations in dealing with complicated problems, while a thorough method, which considers multiple reasoning pathways and verifies each step carefully, results in slower inference. This paper addresses the challenge of enabling LLMs to autonomously select between fast and slow inference methods, thereby optimizing both efficiency and effectiveness. We introduce a dynamic decision-making framework that categorizes tasks into two distinct pathways: 'Fast', designated for tasks where the LLM quickly identifies a high-confidence solution, and 'Slow', allocated for tasks that the LLM perceives as complex and for which it has low confidence in immediate solutions as well as requiring more reasoning paths to verify. Experiments on five popular reasoning benchmarks demonstrated the superiority of the DynaThink over baselines.
