LLM Questionnaire Completion for Automatic Psychiatric Assessment
Gony Rosenman, Lior Wolf, Talma Hendler
TL;DR
The paper tackles the challenge of converting unstructured psychiatric interviews into standardized measures by introducing LMIQ, an impersonation-based questionnaire completion pipeline that converts transcripts into a $135$-dimensional feature vector of LLM-generated responses. This feature vector feeds a Random Forest regressor to predict $PHQ-8$ and $PCL-C$ scores, achieving lower mean-squared errors than multiple baselines on the $E$-DAIC (and related) datasets, with performance enhanced by using the Mixtral 7Bx8 backbone. The work demonstrates that impersonation-based questionnaire completion can bridge narrative interviews and data-driven psychiatric assessment, offering a potentially scalable framework while acknowledging ethical, privacy, and generalizability considerations. Overall, LMIQ provides a novel direction for leveraging LLMs to extract clinically meaningful signals from interview text, potentially augmenting diagnostic workflows with data-driven insights under careful safeguards.
Abstract
We employ a Large Language Model (LLM) to convert unstructured psychological interviews into structured questionnaires spanning various psychiatric and personality domains. The LLM is prompted to answer these questionnaires by impersonating the interviewee. The obtained answers are coded as features, which are used to predict standardized psychiatric measures of depression (PHQ-8) and PTSD (PCL-C), using a Random Forest regressor. Our approach is shown to enhance diagnostic accuracy compared to multiple baselines. It thus establishes a novel framework for interpreting unstructured psychological interviews, bridging the gap between narrative-driven and data-driven approaches for mental health assessment.
