How Does Conversation Length Impact User's Satisfaction? A Case Study of Length-Controlled Conversations with LLM-Powered Chatbots
Shih-Hong Huang, Ya-Fang Lin, Zeyu He, Chieh-Yang Huang, Ting-Hao 'Kenneth' Huang
TL;DR
This study investigates how controlling conversation length affects user satisfaction with LLM-powered chatbots. By building two Slackbots (SlackVanilla and MultiSlack) with adjustable follow-up turns and testing them on 40 questions divided by Conversability, the authors quantify effects via self-reports and MTurk evaluations. Results indicate that longer conversations can boost satisfaction and perceived helpfulness for high-conversability questions, but benefits are not universal and longer turns can introduce repetition or ambiguity. The work highlights the potential and limits of length-adaptive conversational agents and suggests adaptive strategies guided by user context, with future avenues for automatic situational awareness and privacy-aware designs.
Abstract
Users can discuss a wide range of topics with large language models (LLMs), but they do not always prefer solving problems or getting information through lengthy conversations. This raises an intriguing HCI question: How does instructing LLMs to engage in longer or shorter conversations affect conversation quality? In this paper, we developed two Slack chatbots using GPT-4 with the ability to vary conversation lengths and conducted a user study. Participants asked the chatbots both highly and less conversable questions, engaging in dialogues with 0, 3, 5, and 7 conversational turns. We found that the conversation quality does not differ drastically across different conditions, while participants had mixed reactions. Our study demonstrates LLMs' ability to change conversation length and the potential benefits for users resulting from such changes, but we caution that changes in text form may not necessarily imply changes in quality or content.
