SemEval 2024 -- Task 10: Emotion Discovery and Reasoning its Flip in Conversation (EDiReF)

Shivani Kumar; Md Shad Akhtar; Erik Cambria; Tanmoy Chakraborty

SemEval 2024 -- Task 10: Emotion Discovery and Reasoning its Flip in Conversation (EDiReF)

Shivani Kumar, Md Shad Akhtar, Erik Cambria, Tanmoy Chakraborty

TL;DR

SemEval-2024 Task 10 (EDiReF) delivers a comprehensive study of Emotion Recognition in Conversation (ERC) and Emotion Flip Reasoning (EFR) in English and Hindi-English code-mixed dialogue. It introduces three subtasks across code-mixed and monolingual data, accompanied by two new bilingual corpora (MELD-FR and E-MaSaC/EFR-MaSaC) and explicit trigger-labeling guidelines. Across 84 participants and 24 system-descriptions, top results reached F1 scores of approximately 0.78–0.79 for ERC and EFR tasks, with LLMs leading in code-mixed settings and classical ML methods proving competitive in trigger identification. The work highlights practical implications for dialogue systems, such as translating code-mixed data, leveraging context for emotion flips, and developing interpretable triggers, while providing publicly available datasets to spur future research in multilingual ERC and EFR.

Abstract

We present SemEval-2024 Task 10, a shared task centred on identifying emotions and finding the rationale behind their flips within monolingual English and Hindi-English code-mixed dialogues. This task comprises three distinct subtasks - emotion recognition in conversation for code-mixed dialogues, emotion flip reasoning for code-mixed dialogues, and emotion flip reasoning for English dialogues. Participating systems were tasked to automatically execute one or more of these subtasks. The datasets for these tasks comprise manually annotated conversations focusing on emotions and triggers for emotion shifts (The task data is available at https://github.com/LCS2-IIITD/EDiReF-SemEval2024.git). A total of 84 participants engaged in this task, with the most adept systems attaining F1-scores of 0.70, 0.79, and 0.76 for the respective subtasks. This paper summarises the results and findings from 24 teams alongside their system descriptions.

SemEval 2024 -- Task 10: Emotion Discovery and Reasoning its Flip in Conversation (EDiReF)

TL;DR

Abstract

Paper Structure (22 sections, 3 figures, 7 tables)

This paper contains 22 sections, 3 figures, 7 tables.

Introduction
Related Work
Emotion recognition.
Emotion and code-mixing.
Beyond emotion recognition.
Data
English Conversations:
Hindi-English Code-mixed Conversations:
Task and Background
Shared Task Settings
Pilot Study
Participants
Using LLMs
Classical machine learning and deep learning methods
Rule-based and statistical methods
...and 7 more sections

Figures (3)

Figure 1: Examples of emotion-flip reasoning.
Figure 2: Distribution of triggers for the last four utterances from the trigger utterance $i$.
Figure 3: Emotion distribution in E-MaSaC. The colors depict the distribution of emotions capturing positive, negative, mixed, and no feelings (Abbreviations: Ang: Anger, Cnt: Contempt, Dis: Disgust, Fea: Fear, Ntr: Neutral, Sad: Sadness, Sur: Surprise).

SemEval 2024 -- Task 10: Emotion Discovery and Reasoning its Flip in Conversation (EDiReF)

TL;DR

Abstract

SemEval 2024 -- Task 10: Emotion Discovery and Reasoning its Flip in Conversation (EDiReF)

Authors

TL;DR

Abstract

Table of Contents

Figures (3)