MedCoAct: Confidence-Aware Multi-Agent Collaboration for Complete Clinical Decision
Hongjie Zheng, Zesheng Shi, Ping Yi
TL;DR
This work tackles the limitation of autonomous medical AI systems that operate on isolated tasks by introducing MedCoAct, a confidence-aware, dual-agent framework that simulates clinical collaboration between doctor and pharmacist agents for end-to-end diagnosis-to-prescription decisions. It introduces DrugCareQA, a 2,700-case benchmark spanning integrated diagnostic and drug-selection workflows, and demonstrates that role specialization, adaptive query planning, and confidence-aware reflection improve diagnostic and medication accuracy by about 7 percentage points over single-agent baselines. The framework relies on a two-stage, role-aware vector retrieval system and an iterative reflection mechanism to mitigate hallucinations and improve decision quality, with evidence from retrieval quality analyses and ablation studies. The results suggest substantial potential for improved telemedicine and routine clinical scenarios, while highlighting areas for further work in expanding specialties and inter-agent communication to enhance scalability and safety.
Abstract
Autonomous agents utilizing Large Language Models (LLMs) have demonstrated remarkable capabilities in isolated medical tasks like diagnosis and image analysis, but struggle with integrated clinical workflows that connect diagnostic reasoning and medication decisions. We identify a core limitation: existing medical AI systems process tasks in isolation without the cross-validation and knowledge integration found in clinical teams, reducing their effectiveness in real-world healthcare scenarios. To transform the isolation paradigm into a collaborative approach, we propose MedCoAct, a confidence-aware multi-agent framework that simulates clinical collaboration by integrating specialized doctor and pharmacist agents, and present a benchmark, DrugCareQA, to evaluate medical AI capabilities in integrated diagnosis and treatment workflows. Our results demonstrate that MedCoAct achieves 67.58\% diagnostic accuracy and 67.58\% medication recommendation accuracy, outperforming single agent framework by 7.04\% and 7.08\% respectively. This collaborative approach generalizes well across diverse medical domains, proving especially effective for telemedicine consultations and routine clinical scenarios, while providing interpretable decision-making pathways.
