MedAutoCorrect: Image-Conditioned Autocorrection in Medical Reporting
Arnold Caleb Asiimwe, Dídac Surís, Pranav Rajpurkar, Carl Vondrick
TL;DR
This work tackles the critical problem of inaccuracies in radiology reports by introducing image-conditioned autocorrection. It develops a two-stage DETECT+CORRECT framework: a token-level error detector leveraging a Vision Transformer and a medical-text encoder, followed by a GPT-2-based error-corrector that uses image features to produce clinically accurate replacements. Error data are generated via synthetic injection across six error categories, enabling robust training despite domain shifts from general Internet data. The approach yields improvements in both token-level detection and retrieval-based report generation, demonstrating its potential as a guardrail to enhance the reliability and safety of automated medical reporting. The framework highlights a practical path toward safer AI-assisted radiology, while acknowledging dataset limitations and the need for careful deployment in clinical contexts.
Abstract
In medical reporting, the accuracy of radiological reports, whether generated by humans or machine learning algorithms, is critical. We tackle a new task in this paper: image-conditioned autocorrection of inaccuracies within these reports. Using the MIMIC-CXR dataset, we first intentionally introduce a diverse range of errors into reports. Subsequently, we propose a two-stage framework capable of pinpointing these errors and then making corrections, simulating an \textit{autocorrection} process. This method aims to address the shortcomings of existing automated medical reporting systems, like factual errors and incorrect conclusions, enhancing report reliability in vital healthcare applications. Importantly, our approach could serve as a guardrail, ensuring the accuracy and trustworthiness of automated report generation. Experiments on established datasets and state of the art report generation models validate this method's potential in correcting medical reporting errors.
