From Belief Entrenchment to Robust Reasoning in LLM Agents
Jihwan Oh, Minchan Jeong, Jongwoo Ko, Se-Young Yun
TL;DR
This work addresses belief entrenchment in multi agent debate by decomposing the failure into static initial bias and homogenized debate dynamics. It proposes DReaMAD, a two stage framework combining Strategic Prior Knowledge Elicitation and Perspective Diversification to correct biases and diversify reasoning paths. The new MetaNIM Arena benchmark enables rigorous evaluation of adversarial strategic reasoning under impartial game dynamics and Grundy theory. Across extensive experiments, DReaMAD yields meaningful gains in accuracy and win rates, demonstrates long chain of thought abilities without training, and generalizes to math and commonsense reasoning, highlighting a practical route to robust LLM reasoning in dynamic, interactive settings.
Abstract
Multi-Agent Debate (MAD) has emerged as a promising inference scaling method for Large Language Model (LLM) reasoning. However, it frequently suffers from belief entrenchment, where agents reinforce shared errors rather than correcting them. Going beyond merely identifying this failure, we decompose it into two distinct root causes: (1) the model's biased $\textit{static initial belief}$ and (2) $\textit{homogenized debate dynamics}$ that amplify the majority view regardless of correctness. To address these sequentially, we propose $\textbf{DReaMAD}$ $($$\textbf{D}$iverse $\textbf{Rea}$soning via $\textbf{M}$ulti-$\textbf{A}$gent $\textbf{D}$ebate with Refined Prompt$)$. Our framework first rectifies the static belief via strategic prior knowledge elicitation, then reshapes the debate dynamics by enforcing perspective diversity. Validated on our new $\textit{MetaNIM Arena}$ benchmark, $\textbf{DReaMAD}$ significantly mitigates entrenchment, achieving a +9.5\% accuracy gain over ReAct prompting and a +19.0\% higher win rate than standard MAD.
