Table of Contents
Fetching ...

RuOpinionNE-2024: Extraction of Opinion Tuples from Russian News Texts

Natalia Loukachevitch, Natalia Tkachenko, Anna Lapanitsyna, Mikhail Tikhomirov, Nicolay Rusnachenko

TL;DR

This work presents RuOpinionNE-2024, a shared task and dataset for extracting structured opinion tuples (holder, target, polarity, expression) from Russian news sentences. It builds on the RuSentNE corpus and defines a strict evaluation protocol (F1 with token-overlap criteria) over JSON Lines data, reporting results from over 100 submissions that utilized zero-shot, few-shot, and fine-tuned large language models. The top performance (F1 ≈ 0.41) comes from parameter-efficient fine-tuning of a very large model, with extensive exploration of prompts (30 prompts) and 11 models in 1-shot and 10-shot settings. The work provides a publicly available dataset, a rigorous evaluation framework, and practical insights into prompt design and model scaling for structured opinion extraction in Russian, paving the way for more nuanced media analysis.

Abstract

In this paper, we introduce the Dialogue Evaluation shared task on extraction of structured opinions from Russian news texts. The task of the contest is to extract opinion tuples for a given sentence; the tuples are composed of a sentiment holder, its target, an expression and sentiment from the holder to the target. In total, the task received more than 100 submissions. The participants experimented mainly with large language models in zero-shot, few-shot and fine-tuning formats. The best result on the test set was obtained with fine-tuning of a large language model. We also compared 30 prompts and 11 open source language models with 3-32 billion parameters in the 1-shot and 10-shot settings and found the best models and prompts.

RuOpinionNE-2024: Extraction of Opinion Tuples from Russian News Texts

TL;DR

This work presents RuOpinionNE-2024, a shared task and dataset for extracting structured opinion tuples (holder, target, polarity, expression) from Russian news sentences. It builds on the RuSentNE corpus and defines a strict evaluation protocol (F1 with token-overlap criteria) over JSON Lines data, reporting results from over 100 submissions that utilized zero-shot, few-shot, and fine-tuned large language models. The top performance (F1 ≈ 0.41) comes from parameter-efficient fine-tuning of a very large model, with extensive exploration of prompts (30 prompts) and 11 models in 1-shot and 10-shot settings. The work provides a publicly available dataset, a rigorous evaluation framework, and practical insights into prompt design and model scaling for structured opinion extraction in Russian, paving the way for more nuanced media analysis.

Abstract

In this paper, we introduce the Dialogue Evaluation shared task on extraction of structured opinions from Russian news texts. The task of the contest is to extract opinion tuples for a given sentence; the tuples are composed of a sentiment holder, its target, an expression and sentiment from the holder to the target. In total, the task received more than 100 submissions. The participants experimented mainly with large language models in zero-shot, few-shot and fine-tuning formats. The best result on the test set was obtained with fine-tuning of a large language model. We also compared 30 prompts and 11 open source language models with 3-32 billion parameters in the 1-shot and 10-shot settings and found the best models and prompts.

Paper Structure

This paper contains 9 sections, 2 figures, 8 tables.

Figures (2)

  • Figure 1: Example of opinion extraction with explanations according to the RuOpinionNE-2024 task. Two opinion tuples are shown: (Italy, Bersany, beat Florence Mayor, positive) and (AUTHOR, Matteo Renzi, prominent, positive)
  • Figure 2: Example of opinion annotation for the RuOpinionNE-2024 task