Towards Better Question Generation in QA-based Event Extraction
Zijin Hong, Jian Liu
TL;DR
This work tackles QA-based event extraction by addressing the critical role of question quality. It defines four criteria for high-quality questions and develops RLQG, a reinforcement learning framework that combines inverse prompting and QA rewards to produce context-dependent, fluent, generalizable, and indicative questions. The method integrates a supervised QG module with beam search augmentation and uses PPO-based refinement to optimize question quality, evaluated on ACE 2005 and RAMS where it outperforms strong baselines, especially in data-scarce scenarios. An off-the-shelf QA model then answers the generated questions to complete the EE task, with results showing notable gains and robustness across architectures. The work advances QA-based EE by enabling high-quality, adaptable question generation with practical impact for information extraction in low-resource settings, and provides code for replication and further research.
Abstract
Event Extraction (EE) is an essential information extraction task that aims to extract event-related information from unstructured texts. The paradigm of this task has shifted from conventional classification-based methods to more contemporary question-answering-based (QA-based) approaches. However, in QA-based EE, the quality of the questions dramatically affects the extraction accuracy, and how to generate high-quality questions for QA-based EE remains a challenge. In this work, to tackle this challenge, we suggest four criteria to evaluate the quality of a question and propose a reinforcement learning method, RLQG, for QA-based EE that can generate generalizable, high-quality, and context-dependent questions and provides clear guidance to QA models. The extensive experiments conducted on ACE and RAMS datasets have strongly validated our approach's effectiveness, which also demonstrates its robustness in scenarios with limited training data. The corresponding code of RLQG is released for further research.
