Enhancing Document-level Argument Extraction with Definition-augmented Heuristic-driven Prompting for LLMs
Tongyue Sun, Jiayi Xiao
TL;DR
The paper tackles document-level Event Argument Extraction (EAE), a task hampered by data scarcity and complex cross-sentential reasoning. It introduces Definition-augmented Heuristic-driven Prompting (DHP), which integrates explicit event definitions, extraction rules, and heuristic-driven Chain-of-Thought with optimized prompt length to guide LLMs. Empirical results on RAMS and DocEE show that DHP improves Arg-I and Arg-C F1 scores over CoT prompting and several supervised baselines, with notable gains in cross-domain settings, indicating better generalization and reduced labeling needs. The method advances practical document-level EAE by combining structured definitions, rule-based heuristics, and reasoning prompts to improve accuracy and interpretability in LLM-driven extraction.
Abstract
Event Argument Extraction (EAE) is pivotal for extracting structured information from unstructured text, yet it remains challenging due to the complexity of real-world document-level EAE. We propose a novel Definition-augmented Heuristic-driven Prompting (DHP) method to enhance the performance of Large Language Models (LLMs) in document-level EAE. Our method integrates argument extraction-related definitions and heuristic rules to guide the extraction process, reducing error propagation and improving task accuracy. We also employ the Chain-of-Thought (CoT) method to simulate human reasoning, breaking down complex problems into manageable sub-problems. Experiments have shown that our method achieves a certain improvement in performance over existing prompting methods and few-shot supervised learning on document-level EAE datasets. The DHP method enhances the generalization capability of LLMs and reduces reliance on large annotated datasets, offering a novel research perspective for document-level EAE.
