DebUnc: Improving Large Language Model Agent Communication With Uncertainty Metrics
Luke Yoffe, Alfonso Amayuelas, William Yang Wang
TL;DR
DebUnc tackles the problem of overconfident, incorrect LLM outputs in multi-agent debates by quantifying agent uncertainty and communicating it to peers. It introduces two uncertainty communication strategies—prompt-based confidence signaling and a novel attention-scaling mechanism that biases token generation toward more confident agents. Empirical results across multiple LLMs and benchmarks show that attention scaling, particularly Attention-All, delivers the strongest improvements and scales with the quality of the uncertainty metric (Oracle being an idealized bound). The work highlights a practical path to more reliable cooperative reasoning in LLM systems and provides a foundation for developing more robust uncertainty metrics.
Abstract
Multi-agent debates have been introduced to improve the accuracy of Large Language Models (LLMs) by having multiple agents discuss solutions to a problem over several rounds of debate. However, models often generate incorrect yet confident-sounding responses, which can mislead others. This issue arises partly because agents do not consider how confident their peers are. To address this, we propose DebUnc, a debate framework that uses uncertainty metrics to assess agent confidence. Confidence is then conveyed through a modified attention mechanism that adjusts token weights, or through textual prompts. Evaluations across benchmarks show that attention-based methods are particularly effective and that performance continues to improve as uncertainty estimation becomes more reliable. The code is available at https://github.com/lukeyoffe/debunc.
