LocalTweets to LocalHealth: A Mental Health Surveillance Framework Based on Twitter Data
Vijeta Deshpande, Minhwa Lee, Zonghai Yao, Zihao Zhang, Jason Brian Gibbons, Hong Yu
TL;DR
This work introduces LocalTweets, a neighborhood-level mental health surveillance dataset built from 765 US census block groups (2015–2019) and paired with CDC MH outcomes, and LocalHealth, a language-model–based predictor that ingests locally posted tweets to forecast MH prevalence. The authors systematically compare keyword-filtered versus unfiltered (general) tweets and evaluate multiple encoders, showing general tweets often generalize best, while domain-adapted and larger models (e.g., GPT-3.5) yield strong zero-shot performance. They demonstrate that incorporating ADI improves prediction and that LocalHealth can extrapolate CDC estimates to unreported neighborhoods with competitive accuracy (e.g., F1 around 0.73). The framework lays groundwork for real-time, neighborhood-focused MH surveillance and resource allocation guidance, while emphasizing privacy, ethical use, and reproducibility. Key contributions include the LocalTweets benchmark, the LocalHealth regression/prediction pipeline, and a detailed analysis of data availability and spatial extrapolation in population-level MH forecasting. Finally, the work offers practical implications for public health policy and suggests future expansions to other health domains and equitable resource distribution.
Abstract
Prior research on Twitter (now X) data has provided positive evidence of its utility in developing supplementary health surveillance systems. In this study, we present a new framework to surveil public health, focusing on mental health (MH) outcomes. We hypothesize that locally posted tweets are indicative of local MH outcomes and collect tweets posted from 765 neighborhoods (census block groups) in the USA. We pair these tweets from each neighborhood with the corresponding MH outcome reported by the Center for Disease Control (CDC) to create a benchmark dataset, LocalTweets. With LocalTweets, we present the first population-level evaluation task for Twitter-based MH surveillance systems. We then develop an efficient and effective method, LocalHealth, for predicting MH outcomes based on LocalTweets. When used with GPT3.5, LocalHealth achieves the highest F1-score and accuracy of 0.7429 and 79.78\%, respectively, a 59\% improvement in F1-score over the GPT3.5 in zero-shot setting. We also utilize LocalHealth to extrapolate CDC's estimates to proxy unreported neighborhoods, achieving an F1-score of 0.7291. Our work suggests that Twitter data can be effectively leveraged to simulate neighborhood-level MH outcomes.
