Table of Contents
Fetching ...

A Cooperative Multi-Agent Framework for Zero-Shot Named Entity Recognition

Zihan Wang, Ziqi Zhao, Yougang Lyu, Zhumin Chen, Maarten de Rijke, Zhaochun Ren

TL;DR

This paper tackles zero-shot NER under no labeled data by addressing two core challenges: contextual correlations around entities and the unreliable use of demonstrations. It introduces CMAS, a cooperative four-agent framework consisting of a self-annotator, a type-related feature (TRF) extractor, a demonstration discriminator with a self-reflection mechanism, and an overall predictor, enabling two-subtask modeling of entities and TRFs and a selective, context-aware use of demonstrations. Through extensive zero-shot and few-shot experiments on six benchmarks (general and domain-specific), CMAS achieves state-of-the-art results and demonstrates robustness across LLM backbones and varying demonstration counts. Ablation and error analyses show the crucial roles of TRF extraction and demonstration discrimination, with potential gains from external syntactic tools, highlighting the practical impact of cooperative multi-agent prompting for information extraction tasks with limited supervision.

Abstract

Zero-shot named entity recognition (NER) aims to develop entity recognition systems from unannotated text corpora. This task presents substantial challenges due to minimal human intervention. Recent work has adapted large language models (LLMs) for zero-shot NER by crafting specialized prompt templates. It advances model self-learning abilities by incorporating self-annotated demonstrations. However, two important challenges persist: (i) Correlations between contexts surrounding entities are overlooked, leading to wrong type predictions or entity omissions. (ii) The indiscriminate use of task demonstrations, retrieved through shallow similarity-based strategies, severely misleads LLMs during inference. In this paper, we introduce the cooperative multi-agent system (CMAS), a novel framework for zero-shot NER that uses the collective intelligence of multiple agents to address the challenges outlined above. CMAS has four main agents: (i) a self-annotator, (ii) a type-related feature (TRF) extractor, (iii) a demonstration discriminator, and (iv) an overall predictor. To explicitly capture correlations between contexts surrounding entities, CMAS reformulates NER into two subtasks: recognizing named entities and identifying entity type-related features within the target sentence. To enable controllable utilization of demonstrations, a demonstration discriminator is established to incorporate the self-reflection mechanism, automatically evaluating helpfulness scores for the target sentence. Experimental results show that CMAS significantly improves zero-shot NER performance across six benchmarks, including both domain-specific and general-domain scenarios. Furthermore, CMAS demonstrates its effectiveness in few-shot settings and with various LLM backbones.

A Cooperative Multi-Agent Framework for Zero-Shot Named Entity Recognition

TL;DR

This paper tackles zero-shot NER under no labeled data by addressing two core challenges: contextual correlations around entities and the unreliable use of demonstrations. It introduces CMAS, a cooperative four-agent framework consisting of a self-annotator, a type-related feature (TRF) extractor, a demonstration discriminator with a self-reflection mechanism, and an overall predictor, enabling two-subtask modeling of entities and TRFs and a selective, context-aware use of demonstrations. Through extensive zero-shot and few-shot experiments on six benchmarks (general and domain-specific), CMAS achieves state-of-the-art results and demonstrates robustness across LLM backbones and varying demonstration counts. Ablation and error analyses show the crucial roles of TRF extraction and demonstration discrimination, with potential gains from external syntactic tools, highlighting the practical impact of cooperative multi-agent prompting for information extraction tasks with limited supervision.

Abstract

Zero-shot named entity recognition (NER) aims to develop entity recognition systems from unannotated text corpora. This task presents substantial challenges due to minimal human intervention. Recent work has adapted large language models (LLMs) for zero-shot NER by crafting specialized prompt templates. It advances model self-learning abilities by incorporating self-annotated demonstrations. However, two important challenges persist: (i) Correlations between contexts surrounding entities are overlooked, leading to wrong type predictions or entity omissions. (ii) The indiscriminate use of task demonstrations, retrieved through shallow similarity-based strategies, severely misleads LLMs during inference. In this paper, we introduce the cooperative multi-agent system (CMAS), a novel framework for zero-shot NER that uses the collective intelligence of multiple agents to address the challenges outlined above. CMAS has four main agents: (i) a self-annotator, (ii) a type-related feature (TRF) extractor, (iii) a demonstration discriminator, and (iv) an overall predictor. To explicitly capture correlations between contexts surrounding entities, CMAS reformulates NER into two subtasks: recognizing named entities and identifying entity type-related features within the target sentence. To enable controllable utilization of demonstrations, a demonstration discriminator is established to incorporate the self-reflection mechanism, automatically evaluating helpfulness scores for the target sentence. Experimental results show that CMAS significantly improves zero-shot NER performance across six benchmarks, including both domain-specific and general-domain scenarios. Furthermore, CMAS demonstrates its effectiveness in few-shot settings and with various LLM backbones.

Paper Structure

This paper contains 27 sections, 5 equations, 9 figures, 5 tables.

Figures (9)

  • Figure 1: (a) Examples of incorrect type prediction and entity omission from an existing method DBLP:conf/emnlp/XieLZZLW23 due to overlooking correlated contexts surrounding entities. Red texts represent wrongly recognized entities; golden labels are included in the brackets. Blue texts highlight entity TRF, i.e., contexts strongly associated with the entity types. (b) Proportions of selected demonstrations lacking target entity types in the WikiGold DBLP:conf/acl-pwnlp/BalasuriyaRNMC09, WNUT-17 DBLP:conf/aclnut/DerczynskiNEL17, OntoNotes, and BioNLP11 DBLP:journals/bmcbi/PyysaloORSMWSTA12 datasets. More than 40% of demonstrations do not contain any entity types within the target sentence.
  • Figure 2: An overview of CMAS. The dotted red lines indicate the workflow of an existing method DBLP:journals/corr/abs-2311-08921, which leads to incorrect predictions. In contrast, the solid black lines illustrate the workflow of the proposed CMAS, which consists of four key agents: (i) a self-annotator, (ii) a type-related feature extractor, (iii) a demonstration discriminator, and (iv) an overall predictor.
  • Figure 3: Few-shot NER results (F1) on WikiGold, WNUT-17, and GENIA.
  • Figure 4: Influence of task demonstration amount (F1) on WNUT-17 and GENIA.
  • Figure 5: Percentages of different error types on GENIA using CMAS.
  • ...and 4 more figures