Effective QA-driven Annotation of Predicate-Argument Relations Across Languages

Jonathan Davidov; Aviv Slobodkin; Shmuel Tomi Klein; Reut Tsarfaty; Ido Dagan; Ayal Klein

Effective QA-driven Annotation of Predicate-Argument Relations Across Languages

Jonathan Davidov, Aviv Slobodkin, Shmuel Tomi Klein, Reut Tsarfaty, Ido Dagan, Ayal Klein

TL;DR

This work introduces a cross-linguistic projection approach that reuses an English QA-SRL parser within a constrained translation and word-alignment pipeline to automatically generate question-answer annotations aligned with target-language predicates.

Abstract

Explicit representations of predicate-argument relations form the basis of interpretable semantic analysis, supporting reasoning, generation, and evaluation. However, attaining such semantic structures requires costly annotation efforts and has remained largely confined to English. We leverage the Question-Answer driven Semantic Role Labeling (QA-SRL) framework -- a natural-language formulation of predicate-argument relations -- as the foundation for extending semantic annotation to new languages. To this end, we introduce a cross-linguistic projection approach that reuses an English QA-SRL parser within a constrained translation and word-alignment pipeline to automatically generate question-answer annotations aligned with target-language predicates. Applied to Hebrew, Russian, and French -- spanning diverse language families -- the method yields high-quality training data and fine-tuned, language-specific parsers that outperform strong multilingual LLM baselines (GPT-4o, LLaMA-Maverick). By leveraging QA-SRL as a transferable natural-language interface for semantics, our approach enables efficient and broadly accessible predicate-argument parsing across languages.

Effective QA-driven Annotation of Predicate-Argument Relations Across Languages

TL;DR

Abstract

Paper Structure (60 sections, 3 figures, 5 tables)

This paper contains 60 sections, 3 figures, 5 tables.

Introduction
Background and Related Work
Multilingual Semantic Resources
QA-based Semantic Role Labeling
Multilingual QA-SRL Projection Algorithm
Overview
Goal & Motivation
Target Language Requirements
Projection Pipeline Outline
Word Alignment
Predicate Preserving Constrained Translation
Dataset Creation
Languages and Corpora
Evaluation set annotation
Quality Assessment of Data
...and 45 more sections

Figures (3)

Figure 1: A schematic overview of our proposed methodology for transferring QA-SRL to new languages based on the English QA-SRL infrastructure.
Figure 2: F1 score as a function of the IOU threshold
Figure 3: ROC curve for span matching decisions

Effective QA-driven Annotation of Predicate-Argument Relations Across Languages

TL;DR

Abstract

Effective QA-driven Annotation of Predicate-Argument Relations Across Languages

Authors

TL;DR

Abstract

Table of Contents

Figures (3)