Table of Contents
Fetching ...

Mapis: A Knowledge-Graph Grounded Multi-Agent Framework for Evidence-Based PCOS Diagnosis

Zanxiang He, Meng Li, Liyun Shi, Weiye Daia, Liming Nie

TL;DR

Mapis introduces a guideline-grounded, knowledge-graph–driven multi-agent framework for evidence-based PCOS diagnosis that mirrors the clinical Rotterdam workflow. By separating tasks among specialized agents and grounding reasoning in a PCOS knowledge graph, Mapis achieves superior accuracy and interpretability, outperforming traditional ML, single-agent LLMs, and generic multi-agent systems on both public and private clinical datasets. Ablation studies confirm the essential roles of the knowledge graph, strict workflow, and differential exclusion in achieving robust performance. The approach offers zero-shot diagnostic capability with verifiable evidence chains, highlighting a practical path toward reliable AI-assisted decision support in guideline-dependent endocrine disorders.

Abstract

Polycystic Ovary Syndrome (PCOS) constitutes a significant public health issue affecting 10% of reproductive-aged women, highlighting the critical importance of developing effective diagnostic tools. Previous machine learning and deep learning detection tools are constrained by their reliance on large-scale labeled data and an lack of interpretability. Although multi-agent systems have demonstrated robust capabilities, the potential of such systems for PCOS detection remains largely unexplored. Existing medical multi-agent frameworks are predominantly designed for general medical tasks, suffering from insufficient domain integration and a lack of specific domain knowledge. To address these challenges, we propose Mapis, the first knowledge-grounded multi-agent framework explicitly designed for guideline-based PCOS diagnosis. Specifically, it built upon the 2023 International Guideline into a structured collaborative workflow that simulates the clinical diagnostic process. It decouples complex diagnostic tasks across specialized agents: a gynecological endocrine agent and a radiology agent collaborative to verify inclusion criteria, while an exclusion agent strictly rules out other causes. Furthermore, we construct a comprehensive PCOS knowledge graph to ensure verifiable, evidence-based decision-making. Extensive experiments on public benchmarks and specialized clinical datasets, benchmarking against nine diverse baselines, demonstrate that Mapis significantly outperforms competitive methods. On the clinical dataset, it surpasses traditional machine learning models by 13.56%, single-agent by 6.55%, and previous medical multi-agent systems by 7.05% in Accuracy.

Mapis: A Knowledge-Graph Grounded Multi-Agent Framework for Evidence-Based PCOS Diagnosis

TL;DR

Mapis introduces a guideline-grounded, knowledge-graph–driven multi-agent framework for evidence-based PCOS diagnosis that mirrors the clinical Rotterdam workflow. By separating tasks among specialized agents and grounding reasoning in a PCOS knowledge graph, Mapis achieves superior accuracy and interpretability, outperforming traditional ML, single-agent LLMs, and generic multi-agent systems on both public and private clinical datasets. Ablation studies confirm the essential roles of the knowledge graph, strict workflow, and differential exclusion in achieving robust performance. The approach offers zero-shot diagnostic capability with verifiable evidence chains, highlighting a practical path toward reliable AI-assisted decision support in guideline-dependent endocrine disorders.

Abstract

Polycystic Ovary Syndrome (PCOS) constitutes a significant public health issue affecting 10% of reproductive-aged women, highlighting the critical importance of developing effective diagnostic tools. Previous machine learning and deep learning detection tools are constrained by their reliance on large-scale labeled data and an lack of interpretability. Although multi-agent systems have demonstrated robust capabilities, the potential of such systems for PCOS detection remains largely unexplored. Existing medical multi-agent frameworks are predominantly designed for general medical tasks, suffering from insufficient domain integration and a lack of specific domain knowledge. To address these challenges, we propose Mapis, the first knowledge-grounded multi-agent framework explicitly designed for guideline-based PCOS diagnosis. Specifically, it built upon the 2023 International Guideline into a structured collaborative workflow that simulates the clinical diagnostic process. It decouples complex diagnostic tasks across specialized agents: a gynecological endocrine agent and a radiology agent collaborative to verify inclusion criteria, while an exclusion agent strictly rules out other causes. Furthermore, we construct a comprehensive PCOS knowledge graph to ensure verifiable, evidence-based decision-making. Extensive experiments on public benchmarks and specialized clinical datasets, benchmarking against nine diverse baselines, demonstrate that Mapis significantly outperforms competitive methods. On the clinical dataset, it surpasses traditional machine learning models by 13.56%, single-agent by 6.55%, and previous medical multi-agent systems by 7.05% in Accuracy.

Paper Structure

This paper contains 51 sections, 1 equation, 6 figures, 3 tables, 1 algorithm.

Figures (6)

  • Figure 1: Schematic overview of Mapis. The architecture is organized into two core processes integrating five synergistic modules: (1) Knowledge Construction, where the KG Construction Module transforms guidelines into a structured knowledge base; and (2) Diagnostic Execution, which proceeds through the Clinical Data Preprocessing, Three-Step Assessment, Exclusion Modules, and Report Generation Module.
  • Figure 2: An example of the prompt for the Gynecological Endocrine Agent (Step 1).
  • Figure 3: Quantifying the impact of architectural ablation on F1-score.
  • Figure 4: Case study 1.
  • Figure 5: Case study 2.
  • ...and 1 more figures