An Iterative Associative Memory Model for Empathetic Response Generation
Zhou Yang, Zhaochun Ren, Yufeng Wang, Chao Chen, Haizhou Sun, Xiaofei Zhu, Xiangwen Liao
TL;DR
The paper tackles empathetic response generation by modeling how humans iteratively comprehend emotional and cognitive states across dialogue. It introduces the Iterative Associative Memory Model (IAMM), which uses a second-order interaction attention mechanism to extract and store associated words from explicit dialogue content and implicit reasoning knowledge in a memory module. IAMM encodes both explicit information (utterances and situation) and implicit information (reasoning knowledge from COMET), then iteratively updates associations to improve emotion prediction and response generation. On Empathetic-DIALOGUES, IAMM achieves higher emotion accuracy and response diversity than strong baselines, and its large-language-model variants confirm the benefits of focusing on associated words for empathetic understanding and expression.
Abstract
Empathetic response generation aims to comprehend the cognitive and emotional states in dialogue utterances and generate proper responses. Psychological theories posit that comprehending emotional and cognitive states necessitates iteratively capturing and understanding associated words across dialogue utterances. However, existing approaches regard dialogue utterances as either a long sequence or independent utterances for comprehension, which are prone to overlook the associated words between them. To address this issue, we propose an Iterative Associative Memory Model (IAMM) for empathetic response generation. Specifically, we employ a novel second-order interaction attention mechanism to iteratively capture vital associated words between dialogue utterances and situations, dialogue history, and a memory module (for storing associated words), thereby accurately and nuancedly comprehending the utterances. We conduct experiments on the Empathetic-Dialogue dataset. Both automatic and human evaluations validate the efficacy of the model. Variant experiments on LLMs also demonstrate that attending to associated words improves empathetic comprehension and expression.
