Continuous ErrP detections during multimodal human-robot interaction
Su Kyoung Kim, Michael Maurus, Mathias Trampler, Marc Tabie, Elsa Andrea Kirchner
TL;DR
This work tackles continuous ErrP detection in long-duration, multimodal human-robot interaction where a robot communicates its intentions verbally and via gestures. It introduces a framework using forward and backward sliding-window feature extraction with an online PA1 classifier to enable asynchronous ErrP detection, evaluated in a RH5 Manus lunar-scenario with speech and pointing actions. The study achieves an average balanced accuracy of 91% across 9 subjects, highlighting notable inter-subject variability and the potential for per-subject customization of feature selection. The findings establish the feasibility of continuous ErrP-based intrinsic feedback in interactive reinforcement learning and multimodal HRI, with future work aimed at automatic per-subject feature optimization and applying ErrP signals to online robot-learning loops.
Abstract
Human-in-the-loop approaches are of great importance for robot applications. In the presented study, we implemented a multimodal human-robot interaction (HRI) scenario, in which a simulated robot communicates with its human partner through speech and gestures. The robot announces its intention verbally and selects the appropriate action using pointing gestures. The human partner, in turn, evaluates whether the robot's verbal announcement (intention) matches the action (pointing gesture) chosen by the robot. For cases where the verbal announcement of the robot does not match the corresponding action choice of the robot, we expect error-related potentials (ErrPs) in the human electroencephalogram (EEG). These intrinsic evaluations of robot actions by humans, evident in the EEG, were recorded in real time, continuously segmented online and classified asynchronously. For feature selection, we propose an approach that allows the combinations of forward and backward sliding windows to train a classifier. We achieved an average classification performance of 91% across 9 subjects. As expected, we also observed a relatively high variability between the subjects. In the future, the proposed feature selection approach will be extended to allow for customization of feature selection. To this end, the best combinations of forward and backward sliding windows will be automatically selected to account for inter-subject variability in classification performance. In addition, we plan to use the intrinsic human error evaluation evident in the error case by the ErrP in interactive reinforcement learning to improve multimodal human-robot interaction.
