Bloom: Designing for LLM-Augmented Behavior Change Interactions
Matthew Jörke, Defne Genç, Valentin Teutschbein, Shardul Sapkota, Sarah Chung, Paul Schmiedmayer, Maria Ines Campero, Abby C. King, Emma Brunskill, James A. Landay
TL;DR
Bloom investigates how an LLM-powered health coaching agent can augment established UI-based behavior-change interactions to promote physical activity. By integrating MI-informed chat with goal setting, planning, tracking, ambient feedback, and data visualizations, Bloom collects qualitative and quantitative data from a four-week field study (N=54). The results show that the LLM condition shifts psychological factors such as perceived benefits, enjoyment, and self-compassion, and yields more personalized plans and higher engagement, though objective PA gains are similar to a non-LLM control in the short term. These findings suggest LLMs may primarily influence readiness and maintenance pathways for longer-term behavior change, with design implications for coaching, agency, and safety in future LLM-augmented health interventions. The work also contributes a safety benchmark and a red-teaming dataset to advance safe deployment of health-oriented LLM coaching systems.
Abstract
Large language models (LLMs) offer novel opportunities to support health behavior change, yet existing work has narrowly focused on text-only interactions. Building on decades of HCI research demonstrating the effectiveness of UI-based interactions, we present Bloom, an application for physical activity promotion that integrates an LLM-based health coaching chatbot with established UI-based interactions. As part of Bloom's development, we conducted a redteaming evaluation and contribute a safety benchmark dataset. In a four-week randomized field study (N=54) comparing Bloom to a non-LLM control, we observed important shifts in psychological outcomes: participants in the LLM condition reported stronger beliefs that activity was beneficial, greater enjoyment, and more self-compassion. Both conditions significantly increased physical activity levels, doubling the proportion of participants meeting recommended weekly guidelines, though we observed no significant differences between conditions. Instead, our findings suggest that LLMs may be more effective at shifting mindsets that precede longer-term behavior change.
