Leveraging Large Language Models to Power Chatbots for Collecting User Self-Reported Data
Jing Wei, Sungdong Kim, Hyunhoon Jung, Young-Ho Kim
TL;DR
This paper investigates how prompt design for large language models can power chatbots to collect user self-reported health data through natural conversations. It components four prompt designs (Structured/Descriptive × with/without a personality modifier) applied to four health topics, tested in an online study with $N=48$ participants, yielding a slot-filling rate of $79%$. The study shows that prompt format, topic, and conversation path significantly influence both data collection performance and conversational style, including empathy-related behaviors. The findings offer practical guidance for building low-cost, LLM-driven chatbots for personal informatics while outlining ethical considerations, limitations, and avenues for future improvements.
Abstract
Large language models (LLMs) provide a new way to build chatbots by accepting natural language prompts. Yet, it is unclear how to design prompts to power chatbots to carry on naturalistic conversations while pursuing a given goal, such as collecting self-report data from users. We explore what design factors of prompts can help steer chatbots to talk naturally and collect data reliably. To this aim, we formulated four prompt designs with different structures and personas. Through an online study (N = 48) where participants conversed with chatbots driven by different designs of prompts, we assessed how prompt designs and conversation topics affected the conversation flows and users' perceptions of chatbots. Our chatbots covered 79% of the desired information slots during conversations, and the designs of prompts and topics significantly influenced the conversation flows and the data collection performance. We discuss the opportunities and challenges of building chatbots with LLMs.
