LLM-as-a-Supervisor: Mistaken Therapeutic Behaviors Trigger Targeted Supervisory Feedback

Chen Xu; Zhenyu Lv; Tian Lan; Xianyang Wang; Luyao Ji; Leyang Cui; Minqiang Yang; Jian Shen; Qunxi Dong; Xiuling Liu; Juan Wang; Bin Hu

LLM-as-a-Supervisor: Mistaken Therapeutic Behaviors Trigger Targeted Supervisory Feedback

Chen Xu, Zhenyu Lv, Tian Lan, Xianyang Wang, Luyao Ji, Leyang Cui, Minqiang Yang, Jian Shen, Qunxi Dong, Xiuling Liu, Juan Wang, Bin Hu

TL;DR

The paper tackles scalable therapist training by introducing LLM-as-a-Supervisor, a framework that uses universal mistaken behaviors to generate clear, actionable feedback. It presents Mate, a mistake-driven, multi-agent data synthesis pipeline with Validator-Guided Refinement to ensure high-quality supervision data. Fine-tuning open-source models on Mate yields significant gains in mistake localization, category classification, and supervisory feedback quality, with transferable improvements to empathy classification and novice therapist self-efficacy. The results show that lightweight models can achieve professional supervisory capabilities, offering a scalable path for AI-assisted psychotherapy training.

Abstract

Although large language models (LLMs) hold significant promise in psychotherapy, their direct application in patient-facing scenarios raises ethical and safety concerns. Therefore, this work shifts towards developing an LLM as a supervisor to train real therapists. In addition to the privacy of clinical therapist training data, a fundamental contradiction complicates the training of therapeutic behaviors: clear feedback standards are necessary to ensure a controlled training system, yet there is no absolute "gold standard" for appropriate therapeutic behaviors in practice. In contrast, many common therapeutic mistakes are universal and identifiable, making them effective triggers for targeted feedback that can serve as clearer evidence. Motivated by this, we create a novel therapist-training paradigm: (1) guidelines for mistaken behaviors and targeted correction strategies are first established as standards; (2) a human-in-the-loop dialogue-feedback dataset is then constructed, where a mistake-prone agent intentionally makes standard mistakes during interviews naturally, and a supervisor agent locates and identifies mistakes and provides targeted feedback; (3) after fine-tuning on this dataset, the final supervisor model is provided for real therapist training. The detailed experimental results of automated, human and downstream assessments demonstrate that models fine-tuned on our dataset MATE, can provide high-quality feedback according to the clinical guideline, showing significant potential for the therapist training scenario.

LLM-as-a-Supervisor: Mistaken Therapeutic Behaviors Trigger Targeted Supervisory Feedback

TL;DR

Abstract

LLM-as-a-Supervisor: Mistaken Therapeutic Behaviors Trigger Targeted Supervisory Feedback

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)