3HANDS Dataset: Learning from Humans for Generating Naturalistic Handovers with Supernumerary Robotic Limbs
Artin Saberpour Abadian, Yi-Chi Liao, Ata Otaran, Rishabh Dabral, Marie Muehlhaus, Christian Theobalt, Martin Schmitz, Jürgen Steimle
TL;DR
The paper addresses the challenge of enabling naturalistic handovers in hip-mounted supernumerary robotic limbs (SRLs) by introducing the 3HANDS dataset, which captures 946 asymmetric, intimate-space interactions between a primary user and a second person enacting an SRL across 12 daily activities. It presents three CVAE-based models trained on 3HANDS to (i) generate naturalistic handover trajectories, (ii) predict the region of transfer (ROT) where handovers occur, and (iii) predict when to initiate a handover from implicit cues. A two-stage training scheme and a participant-level train-test split demonstrate the models can generate plausible trajectories (MAE ~$2.1$–$2.7$ cm non-autoregressive) and ROTs (MAE $4.02$–$8.04$ cm; MEAE $0.0002$–$0.004$ rad), with a VR user study showing improved perceived naturalness, comfort, timeliness, and appropriateness compared to a baseline. The dataset and models collectively advance data-driven SRL interactions in intimate spaces, and the authors release 3HANDS to support future research in human-robot handovers and peripersonal-space robotics.
Abstract
Supernumerary robotic limbs (SRLs) are robotic structures integrated closely with the user's body, which augment human physical capabilities and necessitate seamless, naturalistic human-machine interaction. For effective assistance in physical tasks, enabling SRLs to hand over objects to humans is crucial. Yet, designing heuristic-based policies for robots is time-consuming, difficult to generalize across tasks, and results in less human-like motion. When trained with proper datasets, generative models are powerful alternatives for creating naturalistic handover motions. We introduce 3HANDS, a novel dataset of object handover interactions between a participant performing a daily activity and another participant enacting a hip-mounted SRL in a naturalistic manner. 3HANDS captures the unique characteristics of SRL interactions: operating in intimate personal space with asymmetric object origins, implicit motion synchronization, and the user's engagement in a primary task during the handover. To demonstrate the effectiveness of our dataset, we present three models: one that generates naturalistic handover trajectories, another that determines the appropriate handover endpoints, and a third that predicts the moment to initiate a handover. In a user study (N=10), we compare the handover interaction performed with our method compared to a baseline. The findings show that our method was perceived as significantly more natural, less physically demanding, and more comfortable.
