Table of Contents
Fetching ...

"Is This Really a Human Peer Supporter?": Misalignments Between Peer Supporters and Experts in LLM-Supported Interactions

Kellie Yu Hui Sim, Roy Ka-Wei Lee, Kenny Tsu Wei Choo

Abstract

Mental health is a growing global concern, prompting interest in AI-driven solutions to expand access to psychosocial support. Peer support, grounded in lived experience, offers a valuable complement to professional care. However, variability in training, effectiveness, and definitions raises concerns about quality, consistency, and safety. Large Language Models (LLMs) present new opportunities to enhance peer support interactions, particularly in real-time, text-based interactions. We present and evaluate an AI-supported system with an LLM-simulated distressed client, context-sensitive LLM-generated suggestions, and real-time emotion visualisations. 2 mixed-methods studies with 12 peer supporters and 5 mental health professionals (i.e., experts) examined the system's effectiveness and implications for practice. Both groups recognised its potential to enhance training and improve interaction quality. However, we found a key tension emerged: while peer supporters engaged meaningfully, experts consistently flagged critical issues in peer supporter responses, such as missed distress cues and premature advice-giving. This misalignment highlights potential limitations in current peer support training, especially in emotionally charged contexts where safety and fidelity to best practices are essential. Our findings underscore the need for standardised, psychologically grounded training, especially as peer support scales globally. They also demonstrate how LLM-supported systems can scaffold this development--if designed with care and guided by expert oversight. This work contributes to emerging conversations on responsible AI integration in mental health and the evolving role of LLMs in augmenting peer-delivered care.

"Is This Really a Human Peer Supporter?": Misalignments Between Peer Supporters and Experts in LLM-Supported Interactions

Abstract

Mental health is a growing global concern, prompting interest in AI-driven solutions to expand access to psychosocial support. Peer support, grounded in lived experience, offers a valuable complement to professional care. However, variability in training, effectiveness, and definitions raises concerns about quality, consistency, and safety. Large Language Models (LLMs) present new opportunities to enhance peer support interactions, particularly in real-time, text-based interactions. We present and evaluate an AI-supported system with an LLM-simulated distressed client, context-sensitive LLM-generated suggestions, and real-time emotion visualisations. 2 mixed-methods studies with 12 peer supporters and 5 mental health professionals (i.e., experts) examined the system's effectiveness and implications for practice. Both groups recognised its potential to enhance training and improve interaction quality. However, we found a key tension emerged: while peer supporters engaged meaningfully, experts consistently flagged critical issues in peer supporter responses, such as missed distress cues and premature advice-giving. This misalignment highlights potential limitations in current peer support training, especially in emotionally charged contexts where safety and fidelity to best practices are essential. Our findings underscore the need for standardised, psychologically grounded training, especially as peer support scales globally. They also demonstrate how LLM-supported systems can scaffold this development--if designed with care and guided by expert oversight. This work contributes to emerging conversations on responsible AI integration in mental health and the evolving role of LLMs in augmenting peer-delivered care.

Paper Structure

This paper contains 88 sections, 12 figures, 16 tables.

Figures (12)

  • Figure 1: Overview of the studies conducted. Study 1 included 12 peer supporters engaging in real-time chats with SimClient, using Suggestions and emotional state visualisations, accompanied by retrospective think-aloud interviews and physiological/behavioural monitoring. Study 2 involved 6 experts (clinical psychologist, counsellors, medical social worker) evaluating Study 1 logs and reflecting on SimClient, Suggestions, and emotional state visualisations.
  • Figure 2: Chat interface used in the study. Participants interact with SimClient by composing text-based responses to client messages. The interface provides real-time emotional feedback, including (A) the client's label and chat duration, (B1) an emoji and label-based summary of emotional states, and (B2) time-series graphs of valence, arousal, and categorical emotions. The main chat window (C) shows the ongoing exchange. Below, Suggestions are displayed in three categories for participants' choosing: motivational interviewing ($S_{MI}$), empathetic responses ($S_{ER}$), and emotional support ($S_{PS}$).
  • Figure 3: Average message lengths (words per message) by participant in Study 1, showing greater variability among peer supporters compared to the relatively stable responses of SimClient.
  • Figure 4: Distribution of peer supporter ratings of SimClient in Study 1 on a 7-point Likert scale. Top: Human-Likeness. Bottom: Realism. Segment width and colour intensity indicate rating frequency (1 = Not at all, 7 = Extremely).
  • Figure 5: Overview of participants' interactions with SimClient, with the usage of Suggestions through three strategies: (1) direct adoption, (2) slight modification, and (3) combining suggestions.
  • ...and 7 more figures