ESCoT: Towards Interpretable Emotional Support Dialogue Systems
Tenggan Zhang, Xinjie Zhang, Jinming Zhao, Li Zhou, Qin Jin
TL;DR
This work tackles the interpretability gap in emotional support dialogue systems by introducing ESCoT, a scheme that mimics emotion identification, understanding via stimulus and appraisal, and regulation through strategy. It constructs ESD-CoT through a two-stage process: (i) ESD with situation generation and strategy enrichment, and (ii) ESD-CoT with CoT annotations expressed as the quintuple $(EM, ES, IA, SR, RE)$, followed by manual corrections. The authors build 1,708 ESD-CoT dialogues and demonstrate that fine-tuning a backbone model on this data yields responses with improved coherence, informativeness, empathy, and strategy consistency, validated by both automatic metrics and human evaluations. The dataset and code are released to promote future research into interpretable emotional support dialogue systems and CoT-based reasoning in this domain.
Abstract
Understanding the reason for emotional support response is crucial for establishing connections between users and emotional support dialogue systems. Previous works mostly focus on generating better responses but ignore interpretability, which is extremely important for constructing reliable dialogue systems. To empower the system with better interpretability, we propose an emotional support response generation scheme, named $\textbf{E}$motion-Focused and $\textbf{S}$trategy-Driven $\textbf{C}$hain-$\textbf{o}$f-$\textbf{T}$hought ($\textbf{ESCoT}$), mimicking the process of $\textit{identifying}$, $\textit{understanding}$, and $\textit{regulating}$ emotions. Specially, we construct a new dataset with ESCoT in two steps: (1) $\textit{Dialogue Generation}$ where we first generate diverse conversation situations, then enhance dialogue generation using richer emotional support strategies based on these situations; (2) $\textit{Chain Supplement}$ where we focus on supplementing selected dialogues with elements such as emotion, stimuli, appraisal, and strategy reason, forming the manually verified chains. Additionally, we further develop a model to generate dialogue responses with better interpretability. We also conduct extensive experiments and human evaluations to validate the effectiveness of the proposed ESCoT and generated dialogue responses. Our data and code are available at $\href{https://github.com/TeigenZhang/ESCoT}{https://github.com/TeigenZhang/ESCoT}$.
