MemEmo: Evaluating Emotion in Memory Systems of Agents

Peng Liu; Zhen Tao; Jihao Zhao; Ding Chen; Yansong Zhang; Cuiping Li; Zhiyu Li; Hong Chen

MemEmo: Evaluating Emotion in Memory Systems of Agents

Peng Liu, Zhen Tao, Jihao Zhao, Ding Chen, Yansong Zhang, Cuiping Li, Zhiyu Li, Hong Chen

TL;DR

An emotion-enhanced memory evaluation benchmark is proposed to assess the performance of mainstream and state-of-the-art memory systems in handling affective information and suggest a new trajectory for future research and system optimization.

Abstract

Memory systems address the challenge of context loss in Large Language Model during prolonged interactions. However, compared to human cognition, the efficacy of these systems in processing emotion-related information remains inconclusive. To address this gap, we propose an emotion-enhanced memory evaluation benchmark to assess the performance of mainstream and state-of-the-art memory systems in handling affective information. We developed the \textbf{H}uman-\textbf{L}ike \textbf{M}emory \textbf{E}motion (\textbf{HLME}) dataset, which evaluates memory systems across three dimensions: emotional information extraction, emotional memory updating, and emotional memory question answering. Experimental results indicate that none of the evaluated systems achieve robust performance across all three tasks. Our findings provide an objective perspective on the current deficiencies of memory systems in processing emotional memories and suggest a new trajectory for future research and system optimization.

MemEmo: Evaluating Emotion in Memory Systems of Agents

TL;DR

Abstract

Paper Structure (18 sections, 14 equations, 3 figures, 4 tables)

This paper contains 18 sections, 14 equations, 3 figures, 4 tables.

Introduction
Related Work
Memory Systems
Emotion Analysis & Evaluation in LLM
Problem Definition
Methodology for Construct HLME
Evaluation Framework of HLME
Emotion Information Extract EIE
Emotion Memory Update EMU
Emotion Question & Answer EQA
Overall Weighted Score
Experiments
Experiments Setup
The Experimental Results Analysis
Overall Evaluation of HLME
...and 3 more sections

Figures (3)

Figure 1: An example illustrates the lack of emotion processing by the memory system in HCI dialogues.
Figure 2: The HLME dataset construction process pipeline.
Figure 3: HLME dataset evaluation scheme.

MemEmo: Evaluating Emotion in Memory Systems of Agents

TL;DR

Abstract

MemEmo: Evaluating Emotion in Memory Systems of Agents

Authors

TL;DR

Abstract

Table of Contents

Figures (3)