A Service Robot's Guide to Interacting with Busy Customers
Suraj Nukala, Meera Sushma, Leimin Tian, Akansel Cosgun, Dana Kulic
TL;DR
This study investigates how service robots should communicate with cognitively busy customers in hospitality settings. Using a two-part, within-subjects design with a Temi robot, it compares baseline, non-verbal acoustic cues, and a multimodal cue set (visualization, micromotion, and speech) to determine effects on attention capture, order accuracy, and user experience. Part 1 finds non-verbal acoustic cues improve attention but without significant subjective gains; Part 2 shows that visualisation best communicates intent, speech aids delivery, and multimodal cues yield the strongest overall performance, especially in accuracy. The results support deploying multimodal communication strategies to enhance efficiency and user satisfaction in dynamic, real-world service environments, while acknowledging limitations of laboratory settings and the need for real-world validation.
Abstract
The growing use of service robots in hospitality highlights the need to understand how to effectively communicate with pre-occupied customers. This study investigates the efficacy of commonly used communication modalities by service robots, namely, acoustic/speech, visual display, and micromotion gestures in capturing attention and communicating intention with a user in a simulated restaurant scenario. We conducted a two-part user study (N=24) using a Temi robot to simulate delivery tasks, with participants engaged in a typing game (MonkeyType) to emulate a state of busyness. The participants' engagement in the typing game is measured by words per minute (WPM) and typing accuracy. In Part 1, we compared non-verbal acoustic cue versus baseline conditions to assess attention capture during a single-cup delivery task. In Part 2, we evaluated the effectiveness of speech, visual display, micromotion and their multimodal combination in conveying specific intentions (correct cup selection) during a two-cup delivery task. The results indicate that, while speech is highly effective in capturing attention, it is less successful in clearly communicating intention. Participants rated visual as the most effective modality for intention clarity, followed by speech, with micromotion being the lowest ranked.These findings provide insights into optimizing communication strategies for service robots, highlighting the distinct roles of attention capture and intention communication in enhancing user experience in dynamic hospitality settings.
