A Cascade Dual-Decoder Model for Joint Entity and Relation Extraction

Jian Cheng; Tian Zhang; Shuang Zhang; Huimin Ren; Guo Yu; Xiliang Zhang; Shangce Gao; Lianbo Ma

A Cascade Dual-Decoder Model for Joint Entity and Relation Extraction

Jian Cheng, Tian Zhang, Shuang Zhang, Huimin Ren, Guo Yu, Xiliang Zhang, Shangce Gao, Lianbo Ma

TL;DR

This work introduces a cascade dual-decoder for joint entity and relation extraction that treats relations as mapping functions and first detects text-level relations (TR) to guide entity pair extraction (HE and TE) in a cascade. By modeling $p_{ heta}(r|x)$, $p_{ heta}(h|r,x)$, and $p_{ heta}(t|r,h,x)$ within a KL-divergence framework, the method reduces error propagation and naturally handles overlapping triples. Empirical results on NYT, WebNLG, and a real open-pit mining dataset show state-of-the-art or competitive performance, with notable gains in exact-match metrics and relation-element extraction, especially in challenging overlapping scenarios. The approach demonstrates strong practical potential for complex information extraction tasks in knowledge graph construction and domain-specific corpora.

Abstract

In knowledge graph construction, a challenging issue is how to extract complex (e.g., overlapping) entities and relationships from a small amount of unstructured historical data. The traditional pipeline methods are to divide the extraction into two separate subtasks, which misses the potential interaction between the two subtasks and may lead to error propagation. In this work, we propose an effective cascade dual-decoder method to extract overlapping relational triples, which includes a text-specific relation decoder and a relation-corresponded entity decoder. Our approach is straightforward and it includes a text-specific relation decoder and a relation-corresponded entity decoder. The text-specific relation decoder detects relations from a sentence at the text level. That is, it does this according to the semantic information of the whole sentence. For each extracted relation, which is with trainable embedding, the relation-corresponded entity decoder detects the corresponding head and tail entities using a span-based tagging scheme. In this way, the overlapping triple problem can be tackled naturally. We conducted experiments on a real-world open-pit mine dataset and two public datasets to verify the method's generalizability. The experimental results demonstrate the effectiveness and competitiveness of our proposed method and achieve better F1 scores under strict evaluation metrics. Our implementation is available at https://github.com/prastunlp/DualDec.

A Cascade Dual-Decoder Model for Joint Entity and Relation Extraction

TL;DR

Abstract

A Cascade Dual-Decoder Model for Joint Entity and Relation Extraction

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)