GLiNER: Generalist Model for Named Entity Recognition using Bidirectional Transformer
Urchade Zaratiana, Nadi Tomeh, Pierre Holat, Thierry Charnois
TL;DR
GLiNER tackles open-type NER with a compact bidirectional LM that matches entity-type embeddings to span representations, enabling parallel extraction and efficient inference. By training on the diverse Pile-NER corpus and using a span-based decoding strategy, GLiNER achieves competitive to superior zero-shot performance compared to ChatGPT and fine-tuned LLMs, across English and multilingual datasets, while maintaining much smaller compute requirements. Key contributions include a unified latent-space formulation for entity-type and span representations, a scalable O(n log n) decoding scheme, and extensive ablations showing robustness across backbones and data regimes. This approach offers a practical, resource-efficient alternative for open-type NER with strong cross-domain and cross-language generalization, suitable for deployment in compute-constrained settings.
Abstract
Named Entity Recognition (NER) is essential in various Natural Language Processing (NLP) applications. Traditional NER models are effective but limited to a set of predefined entity types. In contrast, Large Language Models (LLMs) can extract arbitrary entities through natural language instructions, offering greater flexibility. However, their size and cost, particularly for those accessed via APIs like ChatGPT, make them impractical in resource-limited scenarios. In this paper, we introduce a compact NER model trained to identify any type of entity. Leveraging a bidirectional transformer encoder, our model, GLiNER, facilitates parallel entity extraction, an advantage over the slow sequential token generation of LLMs. Through comprehensive testing, GLiNER demonstrate strong performance, outperforming both ChatGPT and fine-tuned LLMs in zero-shot evaluations on various NER benchmarks.
