Right Prediction, Wrong Reasoning: Uncovering LLM Misalignment in RA Disease Diagnosis
Umakanta Maharana, Sarthak Verma, Avarna Agarwal, Prakashini Mruthyunjaya, Dwarikanath Mahapatra, Sakir Ahmed, Murari Mandal
TL;DR
The paper investigates how large language models perform in diagnosing Rheumatoid Arthritis (RA) using real-world patient data, revealing a critical misalignment: models can predict RA with high accuracy while generating flawed or clinically unsound explanations. It introduces the PreRAID dataset and a retrieval-augmented framework to evaluate multiple LLMs across various knowledge-base sizes and conversational architectures, including single and multi-agent setups. Although diagnostic accuracy often exceeds 90% (with some configurations approaching 100%), expert reviews show that the reasoning behind these predictions is frequently incorrect, exposing a gap between performance and interpretability. The findings highlight the importance of robust, medically valid explanations for clinical use and call for advances in prompts, reasoning verification, and explanatory mechanisms to ensure safe deployment of LLM-based pre-screening tools in healthcare.
Abstract
Large language models (LLMs) offer a promising pre-screening tool, improving early disease detection and providing enhanced healthcare access for underprivileged communities. The early diagnosis of various diseases continues to be a significant challenge in healthcare, primarily due to the nonspecific nature of early symptoms, the shortage of expert medical practitioners, and the need for prolonged clinical evaluations, all of which can delay treatment and adversely affect patient outcomes. With impressive accuracy in prediction across a range of diseases, LLMs have the potential to revolutionize clinical pre-screening and decision-making for various medical conditions. In this work, we study the diagnostic capability of LLMs for Rheumatoid Arthritis (RA) with real world patients data. Patient data was collected alongside diagnoses from medical experts, and the performance of LLMs was evaluated in comparison to expert diagnoses for RA disease prediction. We notice an interesting pattern in disease diagnosis and find an unexpected \textit{misalignment between prediction and explanation}. We conduct a series of multi-round analyses using different LLM agents. The best-performing model accurately predicts rheumatoid arthritis (RA) diseases approximately 95\% of the time. However, when medical experts evaluated the reasoning generated by the model, they found that nearly 68\% of the reasoning was incorrect. This study highlights a clear misalignment between LLMs high prediction accuracy and its flawed reasoning, raising important questions about relying on LLM explanations in clinical settings. \textbf{LLMs provide incorrect reasoning to arrive at the correct answer for RA disease diagnosis.}
