A Multi-Agent LLM Framework for Multi-Domain Low-Resource In-Context NER via Knowledge Retrieval, Disambiguation and Reflective Analysis
Wenxuan Mu, Jinzhong Ning, Di Zhao, Yijia Zhang
TL;DR
KDR-Agent addresses multi-domain, low-resource NER by integrating external knowledge retrieval from Wikipedia, domain-specific disambiguation, and reflective analysis within a two-stage, multi-agent LLM framework. It replaces heavy Retrieval-based demonstrations with concise natural-language type definitions and static contrastive demonstrations, while adding planning, knowledge retrieval, and disambiguation agents to compensate for domain gaps. The Reflective Analysis stage then performs structured self-evaluation to guide a final corrective pass, yielding robust improvements across ten datasets and multiple backbones. Empirical results show consistent gains over zero-shot and few-shot baselines, with the strongest improvements in specialized domains such as Biomedical and Social Media, highlighting the framework’s generalizability and practical impact in low-resource settings.
Abstract
In-context learning (ICL) with large language models (LLMs) has emerged as a promising paradigm for named entity recognition (NER) in low-resource scenarios. However, existing ICL-based NER methods suffer from three key limitations: (1) reliance on dynamic retrieval of annotated examples, which is problematic when annotated data is scarce; (2) limited generalization to unseen domains due to the LLM's insufficient internal domain knowledge; and (3) failure to incorporate external knowledge or resolve entity ambiguities. To address these challenges, we propose KDR-Agent, a novel multi-agent framework for multi-domain low-resource in-context NER that integrates Knowledge retrieval, Disambiguation, and Reflective analysis. KDR-Agent leverages natural-language type definitions and a static set of entity-level contrastive demonstrations to reduce dependency on large annotated corpora. A central planner coordinates specialized agents to (i) retrieve factual knowledge from Wikipedia for domain-specific mentions, (ii) resolve ambiguous entities via contextualized reasoning, and (iii) reflect on and correct model predictions through structured self-assessment. Experiments across ten datasets from five domains demonstrate that KDR-Agent significantly outperforms existing zero-shot and few-shot ICL baselines across multiple LLM backbones. The code and data can be found at https://github.com/MWXGOD/KDR-Agent.
