InterMind: A Doctor-Patient-Family Interactive Depression Assessment System Empowered by Large Language Models
Zhiyuan Zhou, Jilong Liu, Sanwang Wang, Shijie Hao, Yanrong Guo, Richang Hong
TL;DR
This work tackles the inefficiencies and limited interpretability of depression assessment by introducing InterMind, a doctor-patient-family interactive system powered by large language models. It couples an AI Psychological Chatbot for counseling with an AI Psychologist that generates standardized, DSM-V-aligned assistive diagnostic reports using retrieval-augmented generation and chain-of-thought prompts, guided by clinician oversight. Through data augmentation from social-media dialogues and instruction fine-tuning, the approach improves depression binary classification, severity estimation, report quality, and counseling competency, as validated by clinicians and quantitative metrics. The framework enables continuous monitoring and family involvement, aiming to enhance diagnostic precision, efficiency, and caregiving support in real-world clinical workflows.
Abstract
Depression poses significant challenges to patients and healthcare organizations, necessitating efficient assessment methods. Existing paradigms typically focus on a patient-doctor way that overlooks multi-role interactions, such as family involvement in the evaluation and caregiving process. Moreover, current automatic depression detection (ADD) methods usually model depression detection as a classification or regression task, lacking interpretability for the decision-making process. To address these issues, we developed InterMind, a doctor-patient-family interactive depression assessment system empowered by large language models (LLMs). Our system enables patients and families to contribute descriptions, generates assistive diagnostic reports for doctors, and provides actionable insights, improving diagnostic precision and efficiency. To enhance LLMs' performance in psychological counseling and diagnostic interpretability, we integrate retrieval-augmented generation (RAG) and chain-of-thoughts (CoT) techniques for data augmentation, which mitigates the hallucination issue of LLMs in specific scenarios after instruction fine-tuning. Quantitative experiments and professional assessments by clinicians validate the effectiveness of our system.
