Improving Forecasts of Suicide Attempts for Patients with Little Data
Genesis Hang, Annie Chen, Hope Neveux, Matthew K. Nock, Yaniv Yacoby
TL;DR
Problem: Forecasting imminent suicide attempts from EMA data is difficult due to the rarity of events and heterogeneity across patients. Approach: The authors propose Latent Similarity Gaussian Processes (LSGPs), embedding patients in a latent space to share trends among similar patients and using a sparse variational GP with inducing points and a product kernel $K_\\theta(\\widehat{X},\\widehat{X}^\\prime) = K_\\theta^x(X,X^\\prime) \\odot K_\\theta^z(Z,Z^\\prime)$. Contributions: (A) empirical demonstration of strong heterogeneity and underperformance of a single model; (B) development of a continuous latent similarity model that enables data-scarce patients to borrow information; (C) preliminary results showing competitive performance without substantial kernel design and a graph-based view of patient similarity that does not align with demographics; (D) analysis showing demographics do not explain similarity via low modularity. Significance: provides improved forecasting for patients with little data and offers a framework for understanding patient subtypes beyond discrete groupings, with potential to guide interventions.
Abstract
Ecological Momentary Assessment provides real-time data on suicidal thoughts and behaviors, but predicting suicide attempts remains challenging due to their rarity and patient heterogeneity. We show that single models fit to all patients perform poorly, while individualized models improve performance but still overfit to patients with limited data. To address this, we introduce Latent Similarity Gaussian Processes (LSGPs) to capture patient heterogeneity, enabling those with little data to leverage similar patients' trends. Preliminary results show promise: even without kernel-design, we outperform all but one baseline while offering a new understanding of patient similarity.
