An Empathetic AI Coach for Self-Attachment Therapy
Lisa Alazraki, Ali Ghachem, Neophytos Polydorou, Foaad Khosmood, Abbas Edalat
TL;DR
This work tackles the challenge of delivering scalable, empathetic digital psychotherapy by augmenting a rule-based SAT chatbot with deep learning components for emotion recognition and fluent, empathetic utterance retrieval, plus selectable personas. It introduces the EMPATHETICP ERSONAS dataset (1,181 emotion expressions and 2,143 empathetic rewritings of 45 base utterances) and a multi-objective retrieval framework that optimizes empathy, fluency, and novelty, using a RoBERTa-based emotion classifier with $94.96\%$ accuracy and $95.10\%$ macro F1. A non-clinical trial with $N=16$ participants over five days shows higher perceived empathy, engagement, and usefulness compared with a baseline rule-based system, with clinicians noting improved emotion recognition but limited emotional scope. The work provides practical design guidance, a public dataset, and a reference implementation (SATbot) to advance empathetic digital coaching for SAT, highlighting avenues for safety, scalability, and broader clinical validation.
Abstract
In this work, we present a new dataset and a computational strategy for a digital coach that aims to guide users in practicing the protocols of self-attachment therapy. Our framework augments a rule-based conversational agent with a deep-learning classifier for identifying the underlying emotion in a user's text response, as well as a deep-learning assisted retrieval method for producing novel, fluent and empathetic utterances. We also craft a set of human-like personas that users can choose to interact with. Our goal is to achieve a high level of engagement during virtual therapy sessions. We evaluate the effectiveness of our framework in a non-clinical trial with N=16 participants, all of whom have had at least four interactions with the agent over the course of five days. We find that our platform is consistently rated higher for empathy, user engagement and usefulness than the simple rule-based framework. Finally, we provide guidelines to further improve the design and performance of the application, in accordance with the feedback received.
