SemEval 2024 -- Task 10: Emotion Discovery and Reasoning its Flip in Conversation (EDiReF)
Shivani Kumar, Md Shad Akhtar, Erik Cambria, Tanmoy Chakraborty
TL;DR
SemEval-2024 Task 10 (EDiReF) delivers a comprehensive study of Emotion Recognition in Conversation (ERC) and Emotion Flip Reasoning (EFR) in English and Hindi-English code-mixed dialogue. It introduces three subtasks across code-mixed and monolingual data, accompanied by two new bilingual corpora (MELD-FR and E-MaSaC/EFR-MaSaC) and explicit trigger-labeling guidelines. Across 84 participants and 24 system-descriptions, top results reached F1 scores of approximately 0.78–0.79 for ERC and EFR tasks, with LLMs leading in code-mixed settings and classical ML methods proving competitive in trigger identification. The work highlights practical implications for dialogue systems, such as translating code-mixed data, leveraging context for emotion flips, and developing interpretable triggers, while providing publicly available datasets to spur future research in multilingual ERC and EFR.
Abstract
We present SemEval-2024 Task 10, a shared task centred on identifying emotions and finding the rationale behind their flips within monolingual English and Hindi-English code-mixed dialogues. This task comprises three distinct subtasks - emotion recognition in conversation for code-mixed dialogues, emotion flip reasoning for code-mixed dialogues, and emotion flip reasoning for English dialogues. Participating systems were tasked to automatically execute one or more of these subtasks. The datasets for these tasks comprise manually annotated conversations focusing on emotions and triggers for emotion shifts (The task data is available at https://github.com/LCS2-IIITD/EDiReF-SemEval2024.git). A total of 84 participants engaged in this task, with the most adept systems attaining F1-scores of 0.70, 0.79, and 0.76 for the respective subtasks. This paper summarises the results and findings from 24 teams alongside their system descriptions.
