Automated Spinal MRI Labelling from Reports Using a Large Language Model

Robin Y. Park; Rhydian Windsor; Amir Jamaludin; Andrew Zisserman

Automated Spinal MRI Labelling from Reports Using a Large Language Model

Robin Y. Park, Rhydian Windsor, Amir Jamaludin, Andrew Zisserman

TL;DR

A general pipeline to automate the extraction of labels from radiology reports using large language models is proposed, which is validated on spinal MRI reports and shows that the extracted labels can be used to train imaging models to classify the identified conditions in the accompanying MR scans.

Abstract

We propose a general pipeline to automate the extraction of labels from radiology reports using large language models, which we validate on spinal MRI reports. The efficacy of our labelling method is measured on five distinct conditions: spinal cancer, stenosis, spondylolisthesis, cauda equina compression and herniation. Using open-source models, our method equals or surpasses GPT-4 on a held-out set of reports. Furthermore, we show that the extracted labels can be used to train imaging models to classify the identified conditions in the accompanying MR scans. All classifiers trained using automated labels achieve comparable performance to models trained using scans manually annotated by clinicians. Code can be found at https://github.com/robinyjpark/AutoLabelClassifier.

Automated Spinal MRI Labelling from Reports Using a Large Language Model

TL;DR

Abstract

Automated Spinal MRI Labelling from Reports Using a Large Language Model

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (8)