An Autoregressive Text-to-Graph Framework for Joint Entity and Relation Extraction
Urchade Zaratiana, Nadi Tomeh, Pierre Holat, Thierry Charnois
TL;DR
This work reframes joint entity and relation extraction as conditional sequence generation that yields a linearized information-graph, using an autoregressive transformer with a dynamic, span-based vocabulary. A pointing mechanism grounds outputs in the source text, and constrained decoding guarantees well-formed graphs, while sentence augmentation mitigates premature termination of generation. ATG achieves strong results across ACE 05, CoNLL 2004, and SciERC, notably excelling on SciERC and matching or surpassing baselines on other benchmarks, with detailed analyses of attention and structure embeddings to illuminate the model's interpretability. The approach offers a practical, grounded, and controllable alternative to plain-text generative IE methods, with potential for improved grounding and efficiency in real-world knowledge-graph construction tasks.
Abstract
In this paper, we propose a novel method for joint entity and relation extraction from unstructured text by framing it as a conditional sequence generation problem. In contrast to conventional generative information extraction models that are left-to-right token-level generators, our approach is \textit{span-based}. It generates a linearized graph where nodes represent text spans and edges represent relation triplets. Our method employs a transformer encoder-decoder architecture with pointing mechanism on a dynamic vocabulary of spans and relation types. Our model can capture the structural characteristics and boundaries of entities and relations through span representations while simultaneously grounding the generated output in the original text thanks to the pointing mechanism. Evaluation on benchmark datasets validates the effectiveness of our approach, demonstrating competitive results. Code is available at https://github.com/urchade/ATG.
