NUS-Emo at SemEval-2024 Task 3: Instruction-Tuning LLM for Multimodal Emotion-Cause Analysis in Conversations

Meng Luo; Han Zhang; Shengqiong Wu; Bobo Li; Hong Han; Hao Fei

NUS-Emo at SemEval-2024 Task 3: Instruction-Tuning LLM for Multimodal Emotion-Cause Analysis in Conversations

Meng Luo, Han Zhang, Shengqiong Wu, Bobo Li, Hong Han, Hao Fei

TL;DR

This work tackles Multimodal Emotion-Cause Pair Extraction with Emotion Category (MECPE-Cat) for SemEval-2024 Task 3 by building an LLM-based framework that combines emotion recognition in conversation (ERC) with emotion-cause pair extraction (ECPE). The authors select ChatGLM via a pilot study and enhance it through emotion-cause-aware instruction-tuning using LoRA, alongside multimodal encoding with ImageBind and supplemental video descriptions from GPT-4V. The system decomposes MECPE-Cat into ERC and ECPE stages, iteratively refining training data by reinserting predictions and leveraging emotion labels to boost ECPE accuracy, achieving a weighted F1 of 0.3471 with multimodal inputs. The approach secures 2nd place on the MECPE-Cat leaderboard and is accompanied by code and resources to aid reproducibility and further research.

Abstract

This paper describes the architecture of our system developed for Task 3 of SemEval-2024: Multimodal Emotion-Cause Analysis in Conversations. Our project targets the challenges of subtask 2, dedicated to Multimodal Emotion-Cause Pair Extraction with Emotion Category (MECPE-Cat), and constructs a dual-component system tailored to the unique challenges of this task. We divide the task into two subtasks: emotion recognition in conversation (ERC) and emotion-cause pair extraction (ECPE). To address these subtasks, we capitalize on the abilities of Large Language Models (LLMs), which have consistently demonstrated state-of-the-art performance across various natural language processing tasks and domains. Most importantly, we design an approach of emotion-cause-aware instruction-tuning for LLMs, to enhance the perception of the emotions with their corresponding causal rationales. Our method enables us to adeptly navigate the complexities of MECPE-Cat, achieving a weighted average 34.71% F1 score of the task, and securing the 2nd rank on the leaderboard. The code and metadata to reproduce our experiments are all made publicly available.

NUS-Emo at SemEval-2024 Task 3: Instruction-Tuning LLM for Multimodal Emotion-Cause Analysis in Conversations

TL;DR

Abstract

NUS-Emo at SemEval-2024 Task 3: Instruction-Tuning LLM for Multimodal Emotion-Cause Analysis in Conversations

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)