Quality-focused Active Adversarial Policy for Safe Grasping in Human-Robot Interaction
Chenghao Li, Razvan Beuran, Nak Young Chong
TL;DR
Safety in DNN-based vision-guided grasping is challenged when grasp quality maps assign high scores to the human hand or nearby objects in cluttered HRI. The authors present QFAAP, a benign active adversarial policy that combines Adversarial Quality Patch (AQP) and Projected Quality Gradient Descent (PQGD) to actively manipulate the grasp quality map so the hand is deprioritized (set to $Q$-zero) and nearby objects are suppressed, guiding the robot to safer grasps. AQP optimizes a patch via losses $L_q^p$, $L_{tv}$, and $L_d$ to maximize patch efficacy across images, while PQGD enables rapid hand-shape adaptation with a local, constrained update inside the hand region. Evaluated on three grasping datasets and a cobot, QFAAP improves safety metrics (e.g., ND-ACC up to 88% and CH-Rate down to 16%) and remains near real-time, highlighting a practical benign adversarial approach for safer HRI in cluttered environments.
Abstract
Vision-guided robot grasping methods based on Deep Neural Networks (DNNs) have achieved remarkable success in handling unknown objects, attributable to their powerful generalizability. However, these methods with this generalizability tend to recognize the human hand and its adjacent objects as graspable targets, compromising safety during Human-Robot Interaction (HRI). In this work, we propose the Quality-focused Active Adversarial Policy (QFAAP) to solve this problem. Specifically, the first part is the Adversarial Quality Patch (AQP), wherein we design the adversarial quality patch loss and leverage the grasp dataset to optimize a patch with high quality scores. Next, we construct the Projected Quality Gradient Descent (PQGD) and integrate it with the AQP, which contains only the hand region within each real-time frame, endowing the AQP with fast adaptability to the human hand shape. Through AQP and PQGD, the hand can be actively adversarial with the surrounding objects, lowering their quality scores. Therefore, further setting the quality score of the hand to zero will reduce the grasping priority of both the hand and its adjacent objects, enabling the robot to grasp other objects away from the hand without emergency stops. We conduct extensive experiments on the benchmark datasets and a cobot, showing the effectiveness of QFAAP. Our code and demo videos are available here: https://github.com/clee-jaist/QFAAP.
