Enhanced Multi-Tuple Extraction for Alloys: Integrating Pointer Networks and Augmented Attention
Mengzhe Hei, Zhouran Zhang, Qingbao Liu, Yan Pan, Xiang Zhao, Yongqian Peng, Yicong Ye, Xin Zhang, Shuxin Bai
TL;DR
This work tackles the scarcity of robust multi-tuple extraction from materials science literature by proposing a two-stage approach that first uses MatSciBERT with a pointer network for precise entity extraction and then applies an inter- and intra-entity attention-based allocation model to assemble complete tuples. The framework operates on a curated corpus of 255 sentences with 568 tuples across five entity types, achieving high $F1$ scores for 1–4 tuples ($0.963$, $0.947$, $0.848$, $0.753$) and $0.854$ on a random set, while outperforming four large language models in a prompt-based setting. Ablation studies confirm the critical roles of allocation and both attention mechanisms, and discussions compare the approach to traditional NER-RE pipelines, highlighting reduced hallucination risk and greater precision for structured scientific data. Overall, the method provides a scalable, domain-specific alternative to LLMs for extracting precise, structured material properties, enabling more reliable data for data-driven materials design.
Abstract
Extracting high-quality structured information from scientific literature is crucial for advancing material design through data-driven methods. Despite the considerable research in natural language processing for dataset extraction, effective approaches for multi-tuple extraction in scientific literature remain scarce due to the complex interrelations of tuples and contextual ambiguities. In the study, we illustrate the multi-tuple extraction of mechanical properties from multi-principal-element alloys and presents a novel framework that combines an entity extraction model based on MatSciBERT with pointer networks and an allocation model utilizing inter- and intra-entity attention. Our rigorous experiments on tuple extraction demonstrate impressive F1 scores of 0.963, 0.947, 0.848, and 0.753 across datasets with 1, 2, 3, and 4 tuples, confirming the effectiveness of the model. Furthermore, an F1 score of 0.854 was achieved on a randomly curated dataset. These results highlight the model's capacity to deliver precise and structured information, offering a robust alternative to large language models and equipping researchers with essential data for fostering data-driven innovations.
