Probabilistic emotion and sentiment modelling of patient-reported experiences
Curtis Murray, Lewis Mitchell, Jonathan Tuke, Mark Mackay
TL;DR
The paper tackles the challenge of extracting meaningful emotional insights from unstructured patient experience narratives by combining metadata network topic modelling with a probabilistic Naive Bayes emotion recommender. It demonstrates that topic-driven features yield superior performance for high-dimensional emotion prediction and binary sentiment classification (notably $F1$ up to 0.921 and strong $nDCG$/$Q$-measure scores) compared with full-text or lexicon baselines. The approach offers interpretability and scalability, providing practical tools (R package persR and Shiny persShiny) to healthcare researchers and practitioners for real-time feedback analysis. The work highlights that patient-caregiver interactions and engagement factors, rather than solely clinical outcomes, predominantly shape emotional experiences, with significant implications for patient-centered care and service improvement.
Abstract
This study introduces a novel methodology for modelling patient emotions from online patient experience narratives. We employed metadata network topic modelling to analyse patient-reported experiences from Care Opinion, revealing key emotional themes linked to patient-caregiver interactions and clinical outcomes. We develop a probabilistic, context-specific emotion recommender system capable of predicting both multilabel emotions and binary sentiments using a naive Bayes classifier using contextually meaningful topics as predictors. The superior performance of our predicted emotions under this model compared to baseline models was assessed using the information retrieval metrics nDCG and Q-measure, and our predicted sentiments achieved an F1 score of 0.921, significantly outperforming standard sentiment lexicons. This method offers a transparent, cost-effective way to understand patient feedback, enhancing traditional collection methods and informing individualised patient care. Our findings are accessible via an R package and interactive dashboard, providing valuable tools for healthcare researchers and practitioners.
