GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction
Urchade Zaratiana, Nadi Tomeh, Niama El Khbir, Pierre Holat, Thierry Charnois
TL;DR
GraphER reframes joint entity and relation extraction as Graph Structure Learning, starting from an initial, imperfect text-derived graph and iteratively refining it through a Graph Token Transformer. It uses precise node/edge selection to form an initial scaffold, followed by a single-shot graph editing stage that yields a final, structure-aware IE graph, with nodes labeled for entities and edges labeled for relations. Training optimizes a multitask loss that combines node/edge selection, editing, and final classification losses, achieving strong results on ACE 05, CoNLL-2004, and SciERC, often outperforming MPGNN baselines. The approach demonstrates that explicit graph structure learning and global attention over graph tokens can better handle noisy, heterogeneous graphs, enabling robust joint IE with practical implications for knowledge graph construction and NLP pipelines.
Abstract
Information extraction (IE) is an important task in Natural Language Processing (NLP), involving the extraction of named entities and their relationships from unstructured text. In this paper, we propose a novel approach to this task by formulating it as graph structure learning (GSL). By formulating IE as GSL, we enhance the model's ability to dynamically refine and optimize the graph structure during the extraction process. This formulation allows for better interaction and structure-informed decisions for entity and relation prediction, in contrast to previous models that have separate or untied predictions for these tasks. When compared against state-of-the-art baselines on joint entity and relation extraction benchmarks, our model, GraphER, achieves competitive results.
