Toward Automatic Filling of Case Report Forms: A Case Study on Data from an Italian Emergency Department

Gabriela Anna Kaczmarek; Pietro Ferrazzi; Lorenzo Porta; Vicky Rubini; Bernardo Magnini

Toward Automatic Filling of Case Report Forms: A Case Study on Data from an Italian Emergency Department

Gabriela Anna Kaczmarek, Pietro Ferrazzi, Lorenzo Porta, Vicky Rubini, Bernardo Magnini

TL;DR

Results of the case-study show that CRF-filling from real clinical notes in Italian can be approached in a zero-shot setting and LLMs'results are affected by biases, which need to be corrected.

Abstract

Case Report Forms (CRFs) collect data about patients and are at the core of well-established practices to conduct research in clinical settings. With the recent progress of language technologies, there is an increasing interest in automatic CRF-filling from clinical notes, mostly based on the use of Large Language Models (LLMs). However, there is a general scarcity of annotated CRF data, both for training and testing LLMs, which limits the progress on this task. As a step in the direction of providing such data, we present a new dataset of clinical notes from an Italian Emergency Department annotated with respect to a pre-defined CRF containing 134 items to be filled. We provide an analysis of the data, define the CRF-filling task and metric for its evaluation, and report on pilot experiments where we use an open-source state-of-the-art LLM to automatically execute the task. Results of the case-study show that (i) CRF-filling from real clinical notes in Italian can be approached in a zero-shot setting; (ii) LLMs' results are affected by biases (e.g., a cautious behaviour favours "unknown" answers), which need to be corrected.

Toward Automatic Filling of Case Report Forms: A Case Study on Data from an Italian Emergency Department

TL;DR

Results of the case-study show that CRF-filling from real clinical notes in Italian can be approached in a zero-shot setting and LLMs'results are affected by biases, which need to be corrected.

Abstract

Paper Structure (17 sections, 1 figure, 3 tables)

This paper contains 17 sections, 1 figure, 3 tables.

Introduction
Data Analysis and Task Definition
Dataset
Clinical Notes.
Case Report Form.
Annotation Process.
Exploratory analysis outcomes.
CRF-filling: Task Definition
Evaluation Metrics
Experimental Setting
Prompting.
Model.
Hyperparameters.
Baseline.
Results and Discussion
...and 2 more sections

Figures (1)

Figure 1: Example of a clinical note, its English translation (provided only for reader comprehension), and the corresponding CRF items with assigned values. All other CRF fields default to unknown.

Toward Automatic Filling of Case Report Forms: A Case Study on Data from an Italian Emergency Department

TL;DR

Abstract

Toward Automatic Filling of Case Report Forms: A Case Study on Data from an Italian Emergency Department

Authors

TL;DR

Abstract

Table of Contents

Figures (1)