Leveraging Prompt-Learning for Structured Information Extraction from Crohn's Disease Radiology Reports in a Low-Resource Language
Liam Hazan, Gili Focht, Naama Gavrielov, Roi Reichart, Talar Hagopian, Mary-Louise C. Greer, Ruth Cytter Kuint, Dan Turner, Moti Freiman
TL;DR
The paper tackles the problem of extracting structured information from Hebrew Crohn's disease radiology reports, where data imbalance and language resources hinder traditional NLP. It introduces SMP-BERT, a prompt-learning model that pre-trains on a Section Matching Prediction task to connect the Findings and Impression sections, enabling zero-shot inference and data-efficient fine-tuning. In experiments with ~9.7k Hebrew reports, SMP-BERT + tuning achieves a median AUC of 0.99 and median F1 of 0.84, vastly outperforming standard fine-tuning and zero-shot variants, especially for rare phenotypes. The work demonstrates that prompt-learning approaches can deliver high-accuracy information extraction in low-resource languages, with meaningful implications for scalable AI-assisted Crohn's disease diagnostics and broader healthcare NLP.
Abstract
Automatic conversion of free-text radiology reports into structured data using Natural Language Processing (NLP) techniques is crucial for analyzing diseases on a large scale. While effective for tasks in widely spoken languages like English, generative large language models (LLMs) typically underperform with less common languages and can pose potential risks to patient privacy. Fine-tuning local NLP models is hindered by the skewed nature of real-world medical datasets, where rare findings represent a significant data imbalance. We introduce SMP-BERT, a novel prompt learning method that leverages the structured nature of reports to overcome these challenges. In our studies involving a substantial collection of Crohn's disease radiology reports in Hebrew (over 8,000 patients and 10,000 reports), SMP-BERT greatly surpassed traditional fine-tuning methods in performance, notably in detecting infrequent conditions (AUC: 0.99 vs 0.94, F1: 0.84 vs 0.34). SMP-BERT empowers more accurate AI diagnostics available for low-resource languages.
