Entity Retrieval for Answering Entity-Centric Questions
Hassan S. Shavarani, Anoop Sarkar
TL;DR
Entity-centric questions challenge standard retrieval by relying on question-document similarity. The authors propose Entity Retrieval, which uses salient question entities to fetch knowledge-base articles, truncates them to a fixed window, and uses them to augment LLM prompts. Through evaluations on FactoidQA and EntityQuestions, Entity Retrieval achieves higher retrieval quality and QA accuracy with fewer documents and better efficiency than BM25, DPR, and ANCE, with SpEL-based linking enabling operation without manual annotations. The work demonstrates practical gains for offline/embedded deployment and highlights the importance of robust entity linking and knowledge-base selection for real-world entity-centric QA.
Abstract
The similarity between the question and indexed documents is a crucial factor in document retrieval for retrieval-augmented question answering. Although this is typically the only method for obtaining the relevant documents, it is not the sole approach when dealing with entity-centric questions. In this study, we propose Entity Retrieval, a novel retrieval method which rather than relying on question-document similarity, depends on the salient entities within the question to identify the retrieval documents. We conduct an in-depth analysis of the performance of both dense and sparse retrieval methods in comparison to Entity Retrieval. Our findings reveal that our method not only leads to more accurate answers to entity-centric questions but also operates more efficiently.
