RTF: Region-based Table Filling Method for Relational Triple Extraction
Ning An, Lei Hei, Yong Jiang, Weiping Meng, Jingjing Hu, Boran Huang, Feiliang Ren
TL;DR
This work presents Region-based Table Filling (RTF) for relational triple extraction, addressing the spatial gaps of token- and token-pair–level methods. By introducing the Entity Pair as Region (EPR) tagging scheme, region-level representations via a Convolution Block, relational residual learning, and a bi-directional decoding process, RTF effectively identifies entity-pair regions on relation-specific tables. Empirical results on NYT and WebNLG (including rearranged variants) demonstrate state-of-the-art performance and enhanced generalization, with notable recall improvements driven by regional correlations. The approach advances boundary recognition and overlap handling, offering a scalable method for robust RTE in knowledge-graph construction, albeit with higher memory and training-time demands for large-scale, multi-relation settings.
Abstract
Relational triple extraction is crucial work for the automatic construction of knowledge graphs. Existing methods only construct shallow representations from a token or token pair-level. However, previous works ignore local spatial dependencies of relational triples, resulting in a weakness of entity pair boundary detection. To tackle this problem, we propose a novel Region-based Table Filling method (RTF). We devise a novel region-based tagging scheme and bi-directional decoding strategy, which regard each relational triple as a region on the relation-specific table, and identifies triples by determining two endpoints of each region. We also introduce convolution to construct region-level table representations from a spatial perspective which makes triples easier to be captured. In addition, we share partial tagging scores among different relations to improve learning efficiency of relation classifier. Experimental results show that our method achieves state-of-the-art with better generalization capability on three variants of two widely used benchmark datasets.
