Leveraging Open Information Extraction for More Robust Domain Transfer of Event Trigger Detection
David Dukić, Kiril Gashteovski, Goran Glavaš, Jan Šnajder
TL;DR
This paper tackles negative domain transfer in event trigger detection by leveraging open information extraction (OIE) relations as cross-domain mediators. It introduces implicit and explicit multi-task architectures that couple TD with OIE relation extraction, and explores sequential and in-domain transfer regimes, augmented by a target-domain masked language modeling objective. Empirical results show that OIE mediation substantially improves zero- and few-shot TD transfer from Wikipedia to news domains, with even larger gains when MLM is incorporated, and the improvements prove robust to the choice of OIE system. The work suggests a practical path toward universal, domain-robust event extraction by bridging trigger semantics and predicate-argument structures learned through OIE.
Abstract
Event detection is a crucial information extraction task in many domains, such as Wikipedia or news. The task typically relies on trigger detection (TD) -- identifying token spans in the text that evoke specific events. While the notion of triggers should ideally be universal across domains, domain transfer for TD from high- to low-resource domains results in significant performance drops. We address the problem of negative transfer in TD by coupling triggers between domains using subject-object relations obtained from a rule-based open information extraction (OIE) system. We demonstrate that OIE relations injected through multi-task training can act as mediators between triggers in different domains, enhancing zero- and few-shot TD domain transfer and reducing performance drops, in particular when transferring from a high-resource source domain (Wikipedia) to a low(er)-resource target domain (news). Additionally, we combine this improved transfer with masked language modeling on the target domain, observing further TD transfer gains. Finally, we demonstrate that the gains are robust to the choice of the OIE system.
