EMPA: Evaluating Persona-Aligned Empathy as a Process

Shiya Zhang; Yuhan Zhan; Ruixi Su; Ruihan Sun; Ziyi Song; Zhaohan Chen; Xiaofan Zhang

EMPA: Evaluating Persona-Aligned Empathy as a Process

Shiya Zhang, Yuhan Zhan, Ruixi Su, Ruihan Sun, Ziyi Song, Zhaohan Chen, Xiaofan Zhang

TL;DR

EMPA is introduced, a process-oriented framework that evaluates persona-aligned support as sustained intervention rather than isolated replies and distills real interactions into controllable, psychologically grounded scenarios, couples them with an open-ended multi-agent sandbox that exposes strategic adaptation and failure modes.

Abstract

Evaluating persona-aligned empathy in LLM-based dialogue agents remains challenging. User states are latent, feedback is sparse and difficult to verify in situ, and seemingly supportive turns can still accumulate into trajectories that drift from persona-specific needs. We introduce EMPA, a process-oriented framework that evaluates persona-aligned support as sustained intervention rather than isolated replies. EMPA distills real interactions into controllable, psychologically grounded scenarios, couples them with an open-ended multi-agent sandbox that exposes strategic adaptation and failure modes, and scores trajectories in a latent psychological space by directional alignment, cumulative impact, and stability. The resulting signals and metrics support reproducible comparison and optimization of long-horizon empathic behavior, and they extend to other agent settings shaped by latent dynamics and weak, hard-to-verify feedback.

EMPA: Evaluating Persona-Aligned Empathy as a Process

TL;DR

Abstract

Paper Structure (47 sections, 19 equations, 16 figures, 6 tables, 2 algorithms)

This paper contains 47 sections, 19 equations, 16 figures, 6 tables, 2 algorithms.

Introduction
Related Work
Method
System Overview
The Data Pipeline: Real-to-Sim Scenario Generation
The Multi-Agent Simulation Environment
Empathy Potential Model (EPM): A Psychodynamic Vector Formalism
Experiments
Experimental Setup
Main Results
Ablation Study
Summary
Results
Dataset
Overall Performance
...and 32 more sections

Figures (16)

Figure 1: Implicit Assumptions in Agent Evaluation and Their Breakdown in Empathy-Oriented Psychological Support
Figure 2: shows a real EMPA sandbox interaction, revealing a failure of scalar empathy evaluation: high magnitude scores without directional alignment lead to ineffective support, encouraging verbose but misaligned responses.
Figure 3: Overview of EMPA. Real affective interaction data are distilled into psychologically consistent user profiles and crisis scenarios. The evaluated model then engages in unscripted, multi-turn interaction with user agents endowed with persona and long-term memory. Empathic behavior is finally quantified from the resulting interaction trajectories using EPM.
Figure 4: These dimensions provide an operational basis for analyzing LLM behavior in human–AI interaction and user experience.
Figure 5: Sensitivity and robustness under controlled perturbations
...and 11 more figures

EMPA: Evaluating Persona-Aligned Empathy as a Process

TL;DR

Abstract

EMPA: Evaluating Persona-Aligned Empathy as a Process

Authors

TL;DR

Abstract

Table of Contents

Figures (16)