Multi-LLM Collaboration for Medication Recommendation

Huascar Sanchez; Briland Hitaj; Jules Bergmann; Linda Briesemeister

Multi-LLM Collaboration for Medication Recommendation

Huascar Sanchez, Briland Hitaj, Jules Bergmann, Linda Briesemeister

TL;DR

The paper tackles unreliable LLM-driven medication recommendations from unstructured clinical notes by introducing a Chemistry-inspired multi-LLM collaboration framework. It models interaction dynamics through a two-stage generation-evaluation process to achieve efficient, stable, and calibrated ensembles. Evaluation on synthetic clinical vignettes shows that Chemistry-guided ensembles provide competitive accuracy with substantially improved efficiency and robust calibration compared to baselines. The work demonstrates feasibility and sets directions for applying interaction-aware ensembles in real-world clinical decision support, including richer data and retrieval-augmented grounding.

Abstract

As healthcare increasingly turns to AI for scalable and trustworthy clinical decision support, ensuring reliability in model reasoning remains a critical challenge. Individual large language models (LLMs) are susceptible to hallucinations and inconsistency, whereas naive ensembles of models often fail to deliver stable and credible recommendations. Building on our previous work on LLM Chemistry, which quantifies the collaborative compatibility among LLMs, we apply this framework to improve the reliability in medication recommendation from brief clinical vignettes. Our approach leverages multi-LLM collaboration guided by Chemistry-inspired interaction modeling, enabling ensembles that are effective (exploiting complementary strengths), stable (producing consistent quality), and calibrated (minimizing interference and error amplification). We evaluate our Chemistry-based Multi-LLM collaboration strategy on real-world clinical scenarios to investigate whether such interaction-aware ensembles can generate credible, patient-specific medication recommendations. Preliminary results are encouraging, suggesting that LLM Chemistry-guided collaboration may offer a promising path toward reliable and trustworthy AI assistants in clinical practice.

Multi-LLM Collaboration for Medication Recommendation

TL;DR

Abstract

Multi-LLM Collaboration for Medication Recommendation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)