Entity Decomposition with Filtering: A Zero-Shot Clinical Named Entity Recognition Framework
Reza Averly, Xia Ning
TL;DR
This work tackles zero-shot clinical NER by leveraging open NER LLMs through a novel EDF framework. EDF decomposes a target entity type into sub-types, uses an open NER LLM to retrieve sub-type entities, and applies context-aware filtering to produce accurate final extractions, enabling robust performance without training data. Across multiple public clinical datasets and entity types, EDF improves recall via decomposition and precision via filtering, with average F1 gains of $2.54\%$ (UniversalNER) and $5.82\%$ (GNER). The paper also presents thorough ablations and error analyses, showing the framework’s robustness to decomposer and filter choices and highlighting remaining challenges such as abbreviations and polarity-driven context effects. Overall, EDF offers a cost-efficient, zero-shot pathway for clinical NER with open models, guiding future expansion to additional entity types and datasets.
Abstract
Clinical named entity recognition (NER) aims to retrieve important entities within clinical narratives. Recent works have demonstrated that large language models (LLMs) can achieve strong performance in this task. While previous works focus on proprietary LLMs, we investigate how open NER LLMs, trained specifically for entity recognition, perform in clinical NER. Our initial experiment reveals significant contrast in performance for some clinical entities and how a simple exploitment on entity types can alleviate this issue. In this paper, we introduce a novel framework, entity decomposition with filtering, or EDF. Our key idea is to decompose the entity recognition task into several retrievals of entity sub-types and then filter them. Our experimental results demonstrate the efficacies of our framework and the improvements across all metrics, models, datasets, and entity types. Our analysis also reveals substantial improvement in recognizing previously missed entities using entity decomposition. We further provide a comprehensive evaluation of our framework and an in-depth error analysis to pave future works.
