GPTCoach: Towards LLM-Based Physical Activity Coaching
Matthew Jörke, Shardul Sapkota, Lyndsea Warkenthien, Niklas Vainio, Paul Schmiedmayer, Emma Brunskill, James A. Landay
TL;DR
GPTCoach investigates how LLMs can address personalization gaps in mobile health by combining an evidence-based onboarding workflow from Active Choices, MI-inspired conversational strategies, and real-time wearable data. The authors develop a technology-probe GPTCoach, implement a multi-prompt chaining architecture, and evaluate a single-session lab study with 16 participants using three months of HealthKit data, finding strong MI adherence, perceived personalization, and user comfort with data sharing. Key contributions include three design principles for LLM-based physical activity coaching, the GPTCoach system design with data-grounded prompt chains, and insights from a qualitative and quantitative evaluation about MI fidelity, data utilization, and user experience. The work highlights practical implications for future mobile health systems, LLM training/evaluation, and privacy/bias/safety considerations, signaling a promising path toward scalable, context-aware physical activity coaching while acknowledging limitations and safety risks.
Abstract
Mobile health applications show promise for scalable physical activity promotion but are often insufficiently personalized. In contrast, health coaching offers highly personalized support but can be prohibitively expensive and inaccessible. This study draws inspiration from health coaching to explore how large language models (LLMs) might address personalization challenges in mobile health. We conduct formative interviews with 12 health professionals and 10 potential coaching recipients to develop design principles for an LLM-based health coach. We then built GPTCoach, a chatbot that implements the onboarding conversation from an evidence-based coaching program, uses conversational strategies from motivational interviewing, and incorporates wearable data to create personalized physical activity plans. In a lab study with 16 participants using three months of historical data, we find promising evidence that GPTCoach gathers rich qualitative information to offer personalized support, with users feeling comfortable sharing concerns. We conclude with implications for future research on LLM-based physical activity support.
