LLM with Relation Classifier for Document-Level Relation Extraction
Xingzuo Li, Kehai Chen, Yunfei Long, Min Zhang
TL;DR
This paper addresses the performance gap of large language models on document-level relation extraction by identifying NA-dominated candidate spaces as a key bottleneck. It proposes LMRC, a two-stage framework consisting of a Relation Candidate Proposal stage that uses a binary classifier with localized context pooling to filter relation-bearing entity pairs, followed by a Relation Classification stage that leverages LoRA-tuned LLMs to perform multi-class relation labeling on the reduced set. Empirical results on DocRED and Re-DocRED show LMRC substantially outperforms prior LLM-based approaches and narrows the gap with state-of-the-art BERT-based models, with ablations confirming the critical role of the pre-classifier and RC-tuning. Additional analyses on relation-frequency, out-of-domain alignment, and GPT-model integrations demonstrate the method's robustness and practicality for enhancing document-scale relation reasoning in real-world settings.
Abstract
Large language models (LLMs) have created a new paradigm for natural language processing. Despite their advancement, LLM-based methods still lag behind traditional approaches in document-level relation extraction (DocRE), a critical task for understanding complex entity relations within long context. This paper investigates the causes of this performance gap, identifying the dispersion of attention by LLMs due to entity pairs without relations as a key factor. We then introduce a novel classifier-LLM approach to DocRE. Particularly, the proposed approach begins with a classifier designed to select entity pair candidates that exhibit potential relations and then feed them to LLM for final relation classification. This method ensures that the LLM's attention is directed at relation-expressing entity pairs instead of those without relations during inference. Experiments on DocRE benchmarks reveal that our method significantly outperforms recent LLM-based DocRE models and narrows the performance gap with state-of-the-art BERT-based models.
