Small Models Are (Still) Effective Cross-Domain Argument Extractors

William Gantt; Aaron Steven White

Small Models Are (Still) Effective Cross-Domain Argument Extractors

William Gantt, Aaron Steven White

TL;DR

The paper addresses cross-domain transfer in event argument extraction (EAE) by recasting into QA and TI. It demonstrates that small Flan-T5 models trained on a suitable source ontology can achieve zero-shot $F_1$ performance competitive with or surpassing GPT-3.5 and GPT-4 across six datasets. It provides expert-written questions and templates for all six ontologies and analyzes transfer dynamics, including in-domain vs. out-of-domain performance and paraphrase augmentation. The results argue for using compact, ontology-tuned models as a practical first line for cross-domain EAE and highlight continued challenges for highly distant ontology transfer.

Abstract

Effective ontology transfer has been a major goal of recent work on event argument extraction (EAE). Two methods in particular -- question answering (QA) and template infilling (TI) -- have emerged as promising approaches to this problem. However, detailed explorations of these techniques' ability to actually enable this transfer are lacking. In this work, we provide such a study, exploring zero-shot transfer using both techniques on six major EAE datasets at both the sentence and document levels. Further, we challenge the growing reliance on LLMs for zero-shot extraction, showing that vastly smaller models trained on an appropriate source ontology can yield zero-shot performance superior to that of GPT-3.5 or GPT-4.

Small Models Are (Still) Effective Cross-Domain Argument Extractors

TL;DR

performance competitive with or surpassing GPT-3.5 and GPT-4 across six datasets. It provides expert-written questions and templates for all six ontologies and analyzes transfer dynamics, including in-domain vs. out-of-domain performance and paraphrase augmentation. The results argue for using compact, ontology-tuned models as a practical first line for cross-domain EAE and highlight continued challenges for highly distant ontology transfer.

Abstract

Paper Structure (34 sections, 3 figures, 2 tables)

This paper contains 34 sections, 3 figures, 2 tables.

Introduction
Approaches
Question Answering (QA)
Template Infilling (TI)
Experiments
Data
ACE doddington-etal-2004-automatic
ERE Light and Rich song-etal-2015-light
FAMuS vashishtha-etal-2023-famus
RAMS ebner-etal-2020-multi
WikiEvents li-etal-2021-document
Models and Evaluation
QA
TI
OpenAI Models
...and 19 more sections

Figures (3)

Figure 1: Example event types from WikiEvents (left) and from FAMuS (right), along with the templates and questions used for them in this work.
Figure 2: Pearson's $\rho$ between per-event type in-domain and zero-shot argument $\text{F}_1$.
Figure 3: Argument $\text{F}_1$ of QA and TI models trained on FAMuS with and without an additional five paraphrases per question/template.

Small Models Are (Still) Effective Cross-Domain Argument Extractors

TL;DR

Abstract

Small Models Are (Still) Effective Cross-Domain Argument Extractors

Authors

TL;DR

Abstract

Table of Contents

Figures (3)