Learning Human-Robot Handshaking Preferences for Quadruped Robots
Alessandra Chappuis, Guillaume Bellegarda, Auke Ijspeert
TL;DR
The study addresses learning user-specific handshaking preferences for quadruped robots to foster social trust in real-world human–robot interactions. It introduces a parameterized handshake model with amplitude $a$, frequency $f$, and stiffness $K_p$, controlled by a Cartesian PD loop, and optimized through active preference-based reward learning (APReL) using 10 pairwise comparisons per user across 25 participants. The reward is $R(\xi)=\omega^{\top} \Phi(\xi)$, updated with Metropolis-Hastings Bayesian inference and a softmax user response model, yielding personalized handshake parameters that reduce amplitude/frequency errors, DTW, and torque, while increasing user satisfaction (19/25 happy, 5 neutral). The work demonstrates rapid personalization of a social gesture on a quadruped robot, with empirical evidence of improved synchronization and energy efficiency, and provides insights into gender differences and passive versus active handshake dynamics. These findings have practical implications for deploying socially capable quadrupeds in public or service settings where trust and natural interaction are crucial.
Abstract
Quadruped robots are showing impressive abilities to navigate the real world. If they are to become more integrated into society, social trust in interactions with humans will become increasingly important. Additionally, robots will need to be adaptable to different humans based on individual preferences. In this work, we study the social interaction task of learning optimal handshakes for quadruped robots based on user preferences. While maintaining balance on three legs, we parameterize handshakes with a Central Pattern Generator consisting of an amplitude, frequency, stiffness, and duration. Through 10 binary choices between handshakes, we learn a belief model to fit individual preferences for 25 different subjects. Our results show that this is an effective strategy, with 76% of users feeling happy with their identified optimal handshake parameters, and 20% feeling neutral. Moreover, compared with random and test handshakes, the optimized handshakes have significantly decreased errors in amplitude and frequency, lower Dynamic Time Warping scores, and improved energy efficiency, all of which indicate robot synchronization to the user's preferences. Video results can be found at https://youtu.be/elvPv8mq1KM .
