Prompt-Guided Generation of Structured Chest X-Ray Report Using a Pre-trained LLM
Hongzhao Li, Hongyu Wang, Xia Sun, Hua He, Jun Feng
TL;DR
The paper tackles unstructured and non-interactive chest X-ray report generation by introducing a prompt-guided framework that leverages anatomy-region detection to create region-specific sentences, then guides a pre-trained LLM with anatomy and clinical context prompts to produce structured reports. Key contributions include an anatomy-focused detection and sentence-generation pipeline, a binary-prompt system for region presence and abnormalities, and GPT-4 integration with clinician-provided context to yield clinically accurate, interpretable reports. Evaluations on MIMIC-CXR with standard NLG metrics and a clinical-efficiency measure show superior language quality and improved clinical alignment compared to prior methods, along with strong region-level localization. The approach enhances interpretability and interactivity, enabling physician input to tailor reports to patient context and examination rationale, potentially reducing workload and diagnostic error in radiology.
Abstract
Medical report generation automates radiology descriptions from images, easing the burden on physicians and minimizing errors. However, current methods lack structured outputs and physician interactivity for clear, clinically relevant reports. Our method introduces a prompt-guided approach to generate structured chest X-ray reports using a pre-trained large language model (LLM). First, we identify anatomical regions in chest X-rays to generate focused sentences that center on key visual elements, thereby establishing a structured report foundation with anatomy-based sentences. We also convert the detected anatomy into textual prompts conveying anatomical comprehension to the LLM. Additionally, the clinical context prompts guide the LLM to emphasize interactivity and clinical requirements. By integrating anatomy-focused sentences and anatomy/clinical prompts, the pre-trained LLM can generate structured chest X-ray reports tailored to prompted anatomical regions and clinical contexts. We evaluate using language generation and clinical effectiveness metrics, demonstrating strong performance.
