RACER: An LLM-powered Methodology for Scalable Analysis of Semi-structured Mental Health Interviews
Satpreet Harcharan Singh, Kevin Jiang, Kanchan Bhasin, Ashutosh Sabharwal, Nidal Moukaddam, Ankit B Patel
TL;DR
This work presents RACER, an expert-guided LLM-based pipeline that converts semi-structured interview transcripts into thematically clustered insights at scale. By retrieving, aggregating, clustering with expert guidance, and reclustering responses, RACER achieves substantial agreement with human evaluators and provides a scalable approach to analyzing COVID-19-related mental health impacts among 93 healthcare professionals. The study identifies both the potential and limitations of LLM-assisted qualitative analysis, notably that nuanced emotions pose challenges for both humans and machines, and proposes confidence measures to flag ambiguous content. Overall, RACER demonstrates a practical pathway to accelerate qualitative healthcare research while highlighting the enduring need for human expertise in interpreting complex emotional narratives.
Abstract
Semi-structured interviews (SSIs) are a commonly employed data-collection method in healthcare research, offering in-depth qualitative insights into subject experiences. Despite their value, the manual analysis of SSIs is notoriously time-consuming and labor-intensive, in part due to the difficulty of extracting and categorizing emotional responses, and challenges in scaling human evaluation for large populations. In this study, we develop RACER, a Large Language Model (LLM) based expert-guided automated pipeline that efficiently converts raw interview transcripts into insightful domain-relevant themes and sub-themes. We used RACER to analyze SSIs conducted with 93 healthcare professionals and trainees to assess the broad personal and professional mental health impacts of the COVID-19 crisis. RACER achieves moderately high agreement with two human evaluators (72%), which approaches the human inter-rater agreement (77%). Interestingly, LLMs and humans struggle with similar content involving nuanced emotional, ambivalent/dialectical, and psychological statements. Our study highlights the opportunities and challenges in using LLMs to improve research efficiency and opens new avenues for scalable analysis of SSIs in healthcare research.
