Table of Contents
Fetching ...

One Small and One Large for Document-level Event Argument Extraction

Jiaren Peng, Hongda Sun, Wenzhong Yang, Fuyuan Wei, Liang He, Liejun Wang

TL;DR

The first method introduces the Co and Structure Event Argument Extraction model (CsEAE) based on Small Language Models (SLMs), addressing gaps in EAE performance using LLMs under Supervised Fine-Tuning (SFT) conditions and demonstrates that reliable insights validated on SLMs are also applicable to LLMs.

Abstract

Document-level Event Argument Extraction (EAE) faces two challenges due to increased input length: 1) difficulty in distinguishing semantic boundaries between events, and 2) interference from redundant information. To address these issues, we propose two methods. The first method introduces the Co and Structure Event Argument Extraction model (CsEAE) based on Small Language Models (SLMs). CsEAE includes a co-occurrences-aware module, which integrates information about all events present in the current input through context labeling and co-occurrences event prompts extraction. Additionally, CsEAE includes a structure-aware module that reduces interference from redundant information by establishing structural relationships between the sentence containing the trigger and other sentences in the document. The second method introduces new prompts to transform the extraction task into a generative task suitable for Large Language Models (LLMs), addressing gaps in EAE performance using LLMs under Supervised Fine-Tuning (SFT) conditions. We also fine-tuned multiple datasets to develop an LLM that performs better across most datasets. Finally, we applied insights from CsEAE to LLMs, achieving further performance improvements. This suggests that reliable insights validated on SLMs are also applicable to LLMs. We tested our models on the Rams, WikiEvents, and MLEE datasets. The CsEAE model achieved improvements of 2.1\%, 2.3\%, and 3.2\% in the Arg-C F1 metric compared to the baseline, PAIE~\cite{PAIE}. For LLMs, we demonstrated that their performance on document-level datasets is comparable to that of SLMs~\footnote{All code is available at https://github.com/simon-p-j-r/CsEAE}.

One Small and One Large for Document-level Event Argument Extraction

TL;DR

The first method introduces the Co and Structure Event Argument Extraction model (CsEAE) based on Small Language Models (SLMs), addressing gaps in EAE performance using LLMs under Supervised Fine-Tuning (SFT) conditions and demonstrates that reliable insights validated on SLMs are also applicable to LLMs.

Abstract

Document-level Event Argument Extraction (EAE) faces two challenges due to increased input length: 1) difficulty in distinguishing semantic boundaries between events, and 2) interference from redundant information. To address these issues, we propose two methods. The first method introduces the Co and Structure Event Argument Extraction model (CsEAE) based on Small Language Models (SLMs). CsEAE includes a co-occurrences-aware module, which integrates information about all events present in the current input through context labeling and co-occurrences event prompts extraction. Additionally, CsEAE includes a structure-aware module that reduces interference from redundant information by establishing structural relationships between the sentence containing the trigger and other sentences in the document. The second method introduces new prompts to transform the extraction task into a generative task suitable for Large Language Models (LLMs), addressing gaps in EAE performance using LLMs under Supervised Fine-Tuning (SFT) conditions. We also fine-tuned multiple datasets to develop an LLM that performs better across most datasets. Finally, we applied insights from CsEAE to LLMs, achieving further performance improvements. This suggests that reliable insights validated on SLMs are also applicable to LLMs. We tested our models on the Rams, WikiEvents, and MLEE datasets. The CsEAE model achieved improvements of 2.1\%, 2.3\%, and 3.2\% in the Arg-C F1 metric compared to the baseline, PAIE~\cite{PAIE}. For LLMs, we demonstrated that their performance on document-level datasets is comparable to that of SLMs~\footnote{All code is available at https://github.com/simon-p-j-r/CsEAE}.

Paper Structure

This paper contains 36 sections, 9 equations, 5 figures, 7 tables.

Figures (5)

  • Figure 1: An EAE instance from the WikiEvents dataset.
  • Figure 2: Overview of CsEAE. The yellow attention represents the concatenation of co-occurrences-aware module, while the blue attention represents the concatenation of structure-aware module.
  • Figure 3: Prompt for LLMs on WikiEvents. The blue parts represent $\mathcal{I}$, the yellow parts represent $\mathcal{E}$, the green parts represent $\mathcal{Q}$ and the red parts represent co-occurrences- and structure-aware interactions.
  • Figure 4: The performance of different EAE models in extracting arguments at different distances from the triggers. We only measure the Arg-C F1 metric.
  • Figure 5: Two test cases from Rams and WikiEvents.