Bidirectional Human Interactive AI Framework for Social Robot Navigation
Tuba Girgin, Emre Girgin, Yigit Yildirim, Emre Ugur, Mehmet Haklidir
TL;DR
The paper addresses the need for trustworthy, explainable bidirectional human-robot interaction in social navigation for autonomous mobile robots. It proposes an end-to-end framework that integrates RGB-LIDAR perception, a trajectory forecasting backbone based on LSTM encodings to a Graph Attention Network, and a trustworthy AI module that verbalizes decisions while incorporating human input through hand gestures. Key contributions include a novel GNN-based trajectory predictor, a bidirectional audio-visual interaction workflow, and a demonstration in an office environment with a TurtleBot 3, highlighting social-aware planning and conflict resolution. The framework aims to improve safety, transparency, and user comfort in human-centric industrial settings by providing explanations and accepting human guidance online. Future work includes full system integration, dataset collection in smart factories, and user surveys to assess the effectiveness of vocal explanations and gesture-based directives.
Abstract
Trustworthiness is a crucial concept in the context of human-robot interaction. Cooperative robots must be transparent regarding their decision-making process, especially when operating in a human-oriented environment. This paper presents a comprehensive end-to-end framework aimed at fostering trustworthy bidirectional human-robot interaction in collaborative environments for the social navigation of mobile robots. In this framework, the robot communicates verbally while the human guides with gestures. Our method enables a mobile robot to predict the trajectory of people and adjust its route in a socially-aware manner. In case of conflict between human and robot decisions, detected through visual examination, the route is dynamically modified based on human preference while verbal communication is maintained. We present our pipeline, framework design, and preliminary experiments that form the foundation of our proposition.
