MedCoT: Medical Chain of Thought via Hierarchical Expert

Jiaxiang Liu; Yuan Wang; Jiawei Du; Joey Tianyi Zhou; Zuozhu Liu

MedCoT: Medical Chain of Thought via Hierarchical Expert

Jiaxiang Liu, Yuan Wang, Jiawei Du, Joey Tianyi Zhou, Zuozhu Liu

TL;DR

MedCoT tackles the lack of interpretable reasoning and robustness in Med-VQA by introducing a hierarchical expert verification pipeline that cascades through an Initial Specialist, a Follow-up Specialist, and a Diagnostic Specialist empowered by a sparse MoE. The approach couples step-by-step multimodal reasoning with a multimodal T5 backbone to produce not only answers but justifications, validated through expert voting to improve accuracy. Empirical results on four standard Med-VQA datasets show state-of-the-art performance and enhanced interpretability, with significant gains over strong baselines and clear reasoning traces. This work enhances clinical trust and diagnostic reliability by embedding explicit reasoning paths and multi-expert consensus into medical visual question answering.

Abstract

Artificial intelligence has advanced in Medical Visual Question Answering (Med-VQA), but prevalent research tends to focus on the accuracy of the answers, often overlooking the reasoning paths and interpretability, which are crucial in clinical settings. Besides, current Med-VQA algorithms, typically reliant on singular models, lack the robustness needed for real-world medical diagnostics which usually require collaborative expert evaluation. To address these shortcomings, this paper presents MedCoT, a novel hierarchical expert verification reasoning chain method designed to enhance interpretability and accuracy in biomedical imaging inquiries. MedCoT is predicated on two principles: The necessity for explicit reasoning paths in Med-VQA and the requirement for multi-expert review to formulate accurate conclusions. The methodology involves an Initial Specialist proposing diagnostic rationales, followed by a Follow-up Specialist who validates these rationales, and finally, a consensus is reached through a vote among a sparse Mixture of Experts within the locally deployed Diagnostic Specialist, which then provides the definitive diagnosis. Experimental evaluations on four standard Med-VQA datasets demonstrate that MedCoT surpasses existing state-of-the-art approaches, providing significant improvements in performance and interpretability.

MedCoT: Medical Chain of Thought via Hierarchical Expert

TL;DR

Abstract

MedCoT: Medical Chain of Thought via Hierarchical Expert

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)