HOI4ABOT: Human-Object Interaction Anticipation for Human Intention Reading Collaborative roBOTs
Esteve Valls Mascaro, Daniel Sliwowski, Dongheui Lee
TL;DR
HOI anticipation addresses the need for proactive robot assistance in human–robot collaboration. The authors present HOI4ABOT, a transformer-based framework that uses Patch Merger, dual cross-attention Transformers, and Hydra multi-heads to detect and anticipate HOIs in video. They integrate Dynamic Movement Primitives for motion generation and Behavior Trees for planning, and validate on VidHOI with improvements of $1.76\%$ and $1.04\%$ in mAP for detection and anticipation, plus a $15.4\times$ speedup. Real-world experiments with a Franka Emika Panda demonstrate proactive pouring, reducing human waiting time and achieving $85\%$ success across 20 trials. These results highlight the practical potential of intention-reading for improving human–robot collaboration and point to domain-specific data and control-method enhancements for future work.
Abstract
Robots are becoming increasingly integrated into our lives, assisting us in various tasks. To ensure effective collaboration between humans and robots, it is essential that they understand our intentions and anticipate our actions. In this paper, we propose a Human-Object Interaction (HOI) anticipation framework for collaborative robots. We propose an efficient and robust transformer-based model to detect and anticipate HOIs from videos. This enhanced anticipation empowers robots to proactively assist humans, resulting in more efficient and intuitive collaborations. Our model outperforms state-of-the-art results in HOI detection and anticipation in VidHOI dataset with an increase of 1.76% and 1.04% in mAP respectively while being 15.4 times faster. We showcase the effectiveness of our approach through experimental results in a real robot, demonstrating that the robot's ability to anticipate HOIs is key for better Human-Robot Interaction. More information can be found on our project webpage: https://evm7.github.io/HOI4ABOT_page/
