PatientHub: A Unified Framework for Patient Simulation
Sahand Sabour, TszYam NG, Minlie Huang
TL;DR
Fragmentation in patient-simulation methods currently impedes reproducibility and fair benchmarking. The paper introduces PatientHub, a unified modular framework with four core components—Clients and Therapists, Evaluators, Generators, and Events—plus a graph-based dialogue orchestrator and rubric-driven LLM-as-a-judge for cross-method evaluation. It provides standardized abstractions, logging, and open-source implementations to enable synthetic data generation, benchmarking, and rapid prototyping of new simulators, demonstrated across multiple CBT-oriented methods. The framework enables controlled comparisons across simulators and therapists, accelerates method development, and supports scalable synthetic data pipelines for patient-centered dialogue research. Overall, PatientHub offers a practical foundation for reproducible benchmarks, datasets, and future method development in mental-health dialogue systems.
Abstract
As Large Language Models increasingly power role-playing applications, simulating patients has become a valuable tool for training counselors and scaling therapeutic assessment. However, prior work is fragmented: existing approaches rely on incompatible, non-standardized data formats, prompts, and evaluation metrics, hindering reproducibility and fair comparison. In this paper, we introduce PatientHub, a unified and modular framework that standardizes the definition, composition, and deployment of simulated patients. To demonstrate PatientHub's utility, we implement several representative patient simulation methods as case studies, showcasing how our framework supports standardized cross-method evaluation and the seamless integration of custom evaluation metrics. We further demonstrate PatientHub's extensibility by prototyping two new simulator variants, highlighting how PatientHub accelerates method development by eliminating infrastructure overhead. By consolidating existing work into a single reproducible pipeline, PatientHub lowers the barrier to developing new simulation methods and facilitates cross-method and cross-model benchmarking. Our framework provides a practical foundation for future datasets, methods, and benchmarks in patient-centered dialogue, and the code is publicly available via https://github.com/Sahandfer/PatientHub.
