Speaker attribution in German parliamentary debates with QLoRA-adapted large language models

Tobias Bornheim; Niklas Grieger; Patrick Gustav Blaneck; Stephan Bialonski

Speaker attribution in German parliamentary debates with QLoRA-adapted large language models

Tobias Bornheim, Niklas Grieger, Patrick Gustav Blaneck, Stephan Bialonski

TL;DR

The potential of the large language model family Llama 2 to automate speaker attribution in German parliamentary debates from 2017-2021 is studied and a promising avenue for computational analysis of political discourse and the development of semantic role labeling systems is revealed.

Abstract

The growing body of political texts opens up new opportunities for rich insights into political dynamics and ideologies but also increases the workload for manual analysis. Automated speaker attribution, which detects who said what to whom in a speech event and is closely related to semantic role labeling, is an important processing step for computational text analysis. We study the potential of the large language model family Llama 2 to automate speaker attribution in German parliamentary debates from 2017-2021. We fine-tune Llama 2 with QLoRA, an efficient training strategy, and observe our approach to achieve competitive performance in the GermEval 2023 Shared Task On Speaker Attribution in German News Articles and Parliamentary Debates. Our results shed light on the capabilities of large language models in automating speaker attribution, revealing a promising avenue for computational analysis of political discourse and the development of semantic role labeling systems.

Speaker attribution in German parliamentary debates with QLoRA-adapted large language models

TL;DR

Abstract

Paper Structure (16 sections, 3 figures, 3 tables)

This paper contains 16 sections, 3 figures, 3 tables.

Introduction
Data and tasks
Methods
Models
Preprocessing
Training
Inference
Postprocessing and evaluation metrics
Enforcing the output format.
Preventing overlapping cues.
Ignoring made-up words.
Resolving ambiguities.
Including surrounded punctuation.
Evaluating metrics.
Results
...and 1 more sections

Figures (3)

Figure 1: Sentence from the Train dataset with three annotations. The sentence was split into three samples by spaCy (splitting points are indicated by ‡). This segmentation also occurs at not-punctuated positions, as seen in the example sentence ("… rassistischen Positionen ‡ haben die Rechtsradikalen …"). This behavior is due to the data provided by "Open Bundestag", where comments from other members of parliament during an otherwise coherent paragraph force this unintuitive segmentation into two separate paragraphs GermEval2023. As seen in Annotation 2, there can be annotations consisting of only cue word(s). Annotation 1 and Annotation 3 show that annotated roles can span multiple samples.
Figure 2: Example cue prompt and desired model response for the sample "denn wir wissen: Neben ihren rassistischen Positionen" with the cues "wissen" and "Positionen". Shaded in gray are the parts of the prompt and response that are sample dependent. The prompt is used as the Input sequence for training and inference, while the Output sequence contains the desired response with the cues. The end-of-sentence token "</s>" is used to indicate the end of the Output sequence.
Figure 3: Example role prompt and desired model response for the sample "denn wir wissen: Neben ihren rassistischen Positionen" with the cue "wissen". Since roles can be contained in samples different from the one containing the cue, we concatenated the sample with the next two samples of the same speech (transitions between samples are indicated by ‡). Shaded in gray are the parts of the prompt and response that are sample dependent. Similar to the cue prompt, the role prompt is used as the Input sequence for training and inference, while the Output sequence contains the desired response. We append the end-of-sentence token "</s>" to the Output.

Speaker attribution in German parliamentary debates with QLoRA-adapted large language models

TL;DR

Abstract

Speaker attribution in German parliamentary debates with QLoRA-adapted large language models

Authors

TL;DR

Abstract

Table of Contents

Figures (3)