LLMs Learn Task Heuristics from Demonstrations: A Heuristic-Driven Prompting Strategy for Document-Level Event Argument Extraction
Hanzhang Zhou, Junlang Qian, Zijian Feng, Hui Lu, Zixiao Zhu, Kezhi Mao
TL;DR
The paper tackles document-level event argument extraction (EAE) under limited labeled data by exploiting in-context learning (ICL) in large language models. It introduces HD-LoA prompting, which combines explicit task heuristics (heuristic-driven demonstration construction) with link-of-analogy prompting to handle many event types and unseen roles. Empirical results on RAMS and DocEE show consistent gains over standard and chain-of-thought prompting, with strong few-shot performance and cross-domain robustness, plus transferable gains to sentiment analysis and natural language inference. The work provides evidence that LLMs can learn task-specific heuristics from demonstrations and offers a practical prompting strategy to reduce data requirements while maintaining high accuracy.
Abstract
In this study, we investigate in-context learning (ICL) in document-level event argument extraction (EAE) to alleviate the dependency on large-scale labeled data for this task. We introduce the Heuristic-Driven Link-of-Analogy (HD-LoA) prompting to address the challenge of example selection and to develop a prompting strategy tailored for EAE. Specifically, we hypothesize and validate that LLMs learn task-specific heuristics from demonstrations via ICL. Building upon this hypothesis, we introduce an explicit heuristic-driven demonstration construction approach, which transforms the haphazard example selection process into a methodical method that emphasizes task heuristics. Additionally, inspired by the analogical reasoning of human, we propose the link-of-analogy prompting, which enables LLMs to process new situations by drawing analogies to known situations, enhancing their performance on unseen classes beyond limited ICL examples. Experiments show that our method outperforms existing prompting methods and few-shot supervised learning methods on document-level EAE datasets. Additionally, the HD-LoA prompting shows effectiveness in diverse tasks like sentiment analysis and natural language inference, demonstrating its broad adaptability.
