Alternative Interfaces for Human-initiated Natural Language Communication and Robot-initiated Haptic Feedback: Towards Better Situational Awareness in Human-Robot Collaboration
Callum Bennie, Bridget Casey, Cecile Paris, Dana Kulic, Brendan Tidd, Nicholas Lawrance, Alex Pitt, Fletcher Talbot, Jason Williams, David Howard, Pavan Sikka, Hashini Senaratne
TL;DR
The paper addresses the gap in situational awareness for human-robot collaboration by introducing two alternative interfaces: a natural-language speech interface and a forearm-based haptic feedback wearable for Spot robots. It details a pipeline that combines Whisper for speech-to-text, a GPT-3-based interpreter with retrieval-augmented prompting, and Silero for text-to-speech, producing JSON responses to control robots and query status, with a design emphasis on offline viability and multi-robot scalability. It also presents a ten-motor, forearm vibrotactile array controlled via ROS to convey robot status through customizable patterns, including a comprehensive GUI for configuring event-driven patterns and priorities. Preliminary user evaluations indicate that both interfaces support intuitive interaction and improved awareness, while highlighting latency, pattern discrimination, and portability as areas for improvement. Collectively, the work demonstrates the potential of multimodal, human-centered interfaces to enhance responsiveness and reduce cognitive load in real-time human-robot teams, with implications for disaster response and collaborative automation.
Abstract
This article presents an implementation of a natural-language speech interface and a haptic feedback interface that enables a human supervisor to provide guidance to, request information, and receive status updates from a Spot robot. We provide insights gained during preliminary user testing of the interface in a realistic robot exploration scenario.
