Eeyore: Realistic Depression Simulation via Supervised and Preference Optimization
Siyang Liu, Bianca Brie, Wenda Li, Laura Biester, Andrew Lee, James Pennebaker, Rada Mihalcea
TL;DR
The paper tackles the challenge of authentically simulating depressive clients for clinical training, identifying biases in general-purpose LLMs that hinder realism. It presents Eeyore, an 8B model optimized via a three-stage alignment pipeline—Language-specific Alignment, Profile-Guided Role-Play, and Iterative Preference Optimization—to produce linguistically authentic, profile-consistent depression dialogues. By constructing real-world depression dialogue datasets, encoding structured psychological profiles, and applying a two-stage Direct Preference Optimization (with model-generated and expert preferences), Eeyore outperforms GPT-4o-based baselines in both linguistic authenticity and profile adherence. The work demonstrates the value of expert-in-the-loop alignment for clinically meaningful depression simulations and offers a pathway for deploying realistic role-play tools in mental health training.
Abstract
Large Language Models (LLMs) have been previously explored for mental healthcare training and therapy client simulation, but they still fall short in authentically capturing diverse client traits and psychological conditions. We introduce \textbf{Eeyore}, an 8B model optimized for realistic depression simulation through a structured alignment framework, incorporating expert input at every stage. First, we systematically curate real-world depression-related conversations, extracting depressive traits to guide data filtering and psychological profile construction, and use this dataset to instruction-tune Eeyore for profile adherence. Next, to further enhance realism, Eeyore undergoes iterative preference optimization -- first leveraging model-generated preferences and then calibrating with a small set of expert-annotated preferences. Throughout the entire pipeline, we actively collaborate with domain experts, developing interactive interfaces to validate trait extraction and iteratively refine structured psychological profiles for clinically meaningful role-play customization. Despite its smaller model size, the Eeyore depression simulation outperforms GPT-4o with SOTA prompting strategies, both in linguistic authenticity and profile adherence.
