Table of Contents
Fetching ...

Beyond Direct Diagnosis: LLM-based Multi-Specialist Agent Consultation for Automatic Diagnosis

Haochun Wang, Sendong Zhao, Zewen Qiang, Nuwa Xi, Bing Qin, Ting Liu

TL;DR

The paper addresses automatic diagnosis by modeling real-world clinical workflows as multi-specialist consultations. It introduces the Agent-derived Multi-Specialist Consultation (AMSC) framework, which uses tuning-free LLMs as a general practitioner and disease-specific specialist agents, framing diagnosis as a hierarchical MCQA and fusing predictions with a self-attention based Adaptive Probability Distribution Fusion (APDF) mechanism; final predictions are computed as $p_{disease_i} = \frac{\texttt{LLM}(q,\text{opts}; i)}{\sum_{j} \texttt{LLM}(q,\text{opts}; j)}$ and refined through $Q,K,V$ projections to produce the final $p_{disease}$. The approach achieves up to a 3.2% accuracy boost over strong baselines across three real-world datasets, while dramatically reducing training time (up to 95.5%) and trainable parameters (up to 99.99%), and provides insight into the role of implicit symptoms and knowledge alignment. By leveraging open-source LLMs deployed privately, AMSC offers a privacy-conscious, scalable path toward practical AI-assisted diagnosis, with explicit symptoms shown to carry substantial diagnostic signal and implicit symptoms offering limited additional benefit in some configurations.

Abstract

Automatic diagnosis is a significant application of AI in healthcare, where diagnoses are generated based on the symptom description of patients. Previous works have approached this task directly by modeling the relationship between the normalized symptoms and all possible diseases. However, in the clinical diagnostic process, patients are initially consulted by a general practitioner and, if necessary, referred to specialists in specific domains for a more comprehensive evaluation. The final diagnosis often emerges from a collaborative consultation among medical specialist groups. Recently, large language models have shown impressive capabilities in natural language understanding. In this study, we adopt tuning-free LLM-based agents as medical practitioners and propose the Agent-derived Multi-Specialist Consultation (AMSC) framework to model the diagnosis process in the real world by adaptively fusing probability distributions of agents over potential diseases. Experimental results demonstrate the superiority of our approach compared with baselines. Notably, our approach requires significantly less parameter updating and training time, enhancing efficiency and practical utility. Furthermore, we delve into a novel perspective on the role of implicit symptoms within the context of automatic diagnosis.

Beyond Direct Diagnosis: LLM-based Multi-Specialist Agent Consultation for Automatic Diagnosis

TL;DR

The paper addresses automatic diagnosis by modeling real-world clinical workflows as multi-specialist consultations. It introduces the Agent-derived Multi-Specialist Consultation (AMSC) framework, which uses tuning-free LLMs as a general practitioner and disease-specific specialist agents, framing diagnosis as a hierarchical MCQA and fusing predictions with a self-attention based Adaptive Probability Distribution Fusion (APDF) mechanism; final predictions are computed as and refined through projections to produce the final . The approach achieves up to a 3.2% accuracy boost over strong baselines across three real-world datasets, while dramatically reducing training time (up to 95.5%) and trainable parameters (up to 99.99%), and provides insight into the role of implicit symptoms and knowledge alignment. By leveraging open-source LLMs deployed privately, AMSC offers a privacy-conscious, scalable path toward practical AI-assisted diagnosis, with explicit symptoms shown to carry substantial diagnostic signal and implicit symptoms offering limited additional benefit in some configurations.

Abstract

Automatic diagnosis is a significant application of AI in healthcare, where diagnoses are generated based on the symptom description of patients. Previous works have approached this task directly by modeling the relationship between the normalized symptoms and all possible diseases. However, in the clinical diagnostic process, patients are initially consulted by a general practitioner and, if necessary, referred to specialists in specific domains for a more comprehensive evaluation. The final diagnosis often emerges from a collaborative consultation among medical specialist groups. Recently, large language models have shown impressive capabilities in natural language understanding. In this study, we adopt tuning-free LLM-based agents as medical practitioners and propose the Agent-derived Multi-Specialist Consultation (AMSC) framework to model the diagnosis process in the real world by adaptively fusing probability distributions of agents over potential diseases. Experimental results demonstrate the superiority of our approach compared with baselines. Notably, our approach requires significantly less parameter updating and training time, enhancing efficiency and practical utility. Furthermore, we delve into a novel perspective on the role of implicit symptoms within the context of automatic diagnosis.
Paper Structure (20 sections, 6 equations, 5 figures, 4 tables)

This paper contains 20 sections, 6 equations, 5 figures, 4 tables.

Figures (5)

  • Figure 1: An example of data for automatic diagnosis from online consulting platforms.
  • Figure 2: Scenarios of three medical diagnostic processes. A. Individual Practitioner Consultation: a singular general practitioner or specialist formulates the diagnosis; B. Practitioner Group Consultation: a group of professionals collaboratively arriving at a diagnostic conclusion; C. Agent-based Group Consultation: the diagnosis is derived from the decision fusion from multiple agent-based specialists.
  • Figure 3: Illustration of LLM-derived General Practitioner, Agent-derived Specialist and Agent-derived Multiple Specialist Consultation.
  • Figure 4: Recall for the four agent-derived specialists on the diseases of the MuZhi-4 dataset with corresponding knowledge for a specific disease and for an LLM-derived general practitioner without knowledge ("None" in the figure).
  • Figure 5: Accuracy for automatic diagnosis with various decision fusion techniques.