Depression Detection on Social Media with Large Language Models
Xiaochong Lan, Zhiguang Han, Yiming Cheng, Li Sheng, Jie Feng, Chen Gao, Yong Li
TL;DR
This paper addresses the challenge of detecting depression from social media while requiring medical grounding and model explainability. It proposes DORIS, a hybrid framework that uses LLMs to annotate posts against DSM-5 criteria and to summarize mood trajectories, which are then fused with a Gradient Boosting Trees classifier for accurate predictions. The approach demonstrates a substantial AUPRC gain over strong baselines on the SWDD dataset and generalizes to the Twitter Mental Disorder dataset, with ablations confirming the critical role of diagnostic-criteria features and mood-course representations. By delivering evidence-based explanations grounded in clinical knowledge, DORIS offers a practical, clinician-supportive tool that can enhance large-scale depression screening in low-resource settings.
Abstract
Limited access to mental healthcare resources hinders timely depression diagnosis, leading to detrimental outcomes. Social media platforms present a valuable data source for early detection, yet this task faces two significant challenges: 1) the need for medical knowledge to distinguish clinical depression from transient mood changes, and 2) the dual requirement for high accuracy and model explainability. To address this, we propose DORIS, a framework that leverages Large Language Models (LLMs). To integrate medical knowledge, DORIS utilizes LLMs to annotate user texts against established medical diagnostic criteria and to summarize historical posts into temporal mood courses. These medically-informed features are then used to train an accurate Gradient Boosting Tree (GBT) classifier. Explainability is achieved by generating justifications for predictions based on the LLM-derived symptom annotations and mood course analyses. Extensive experimental results validate the effectiveness as well as interpretability of our method, highlighting its potential as a supportive clinical tool.
