Multi-Agent Collaborative Intelligence: Dual-Dial Control for Reliable LLM Reasoning
Edward Y. Chang, Ethan Y. Chang
TL;DR
MACI presents a dual-dial controller for multi-agent LLM reasoning that separately gates evidence quality and controls interaction intensity, with a moderator that stops debates when progress plateaus. The method comes with theory-lite guarantees (nonincreasing dispersion and plateau termination in $O(1/cepsilon)$ rounds, and $ ilde{O}(\\sqrt{KT})$ no-regret for a budgeted scheduler) and demonstrates improved accuracy and calibration while reducing tokens in clinical diagnosis and news-bias tasks. A cross-family evaluator (CRIT) provides robust, non-oracle judgment that supports soft weighting and stopping, with stability validated under judge swaps when high-capability models are used. MACI also translates residual uncertainty into precision RAG plans to guide what to retrieve next, showing portability across domains without domain-specific tuning. Overall, MACI reframes multi-agent debate as a budget-aware, measurable, and provably terminating controller that balances exploration and consolidation through principled information-theoretic signals.
Abstract
Multi-agent debate often wastes compute by using a fixed adversarial stance, aggregating without deliberation, or stopping on heuristics. We introduce MACI, an active controller with two independent dials that decouple information from behavior: an information dial that gates evidence by quality, and a behavior dial that schedules contentiousness from exploration to consolidation. A moderator tracks disagreement, overlap, evidence quality, and argument quality, and halts when gains plateau. We provide theory-lite guarantees for nonincreasing dispersion and provable termination, with a budget-feasible scheduler. Across clinical diagnosis and news-bias tasks, MACI improves accuracy and calibration while reducing tokens, and converts residual uncertainty into precision RAG plans that specify what to retrieve next. We use a cross-family LLM judge (CRIT) as a conservative soft weight and stop signal, validated for order invariance and judge-swap stability; stability depends on using high-capability judges. MACI turns debate into a budget-aware, measurable, and provably terminating controller.
