Collaborative Assembly Policy Learning of a Sightless Robot

Zeqing Zhang; Weifeng Lu; Lei Yang; Wei Jing; Bowei Tang; Jia Pan

Collaborative Assembly Policy Learning of a Sightless Robot

Zeqing Zhang, Weifeng Lu, Lei Yang, Wei Jing, Bowei Tang, Jia Pan

TL;DR

This work tackles a sightless-robot board-insertion task requiring precise co-manipulation with a human operator under sparse rewards. It introduces Policy-Guided PPO (PGPPO), which fuses a human-designed admittance-control policy with reinforcement learning and leverages human demonstrations to bootstrap learning, aided by a simplified human dynamics model and domain randomization. In both simulation and real-world tests, PGPPO outperformed pure admittance control and standard PPO, achieving higher success rates, shorter completion times, and significantly lower interaction forces during insertion. The results demonstrate that admittance-control guidance plus demonstrations can dramatically improve safety and efficiency in physical human-robot collaboration, with potential to generalize to other millimeter-tolerance co-manipulation tasks.

Abstract

This paper explores a physical human-robot collaboration (pHRC) task involving the joint insertion of a board into a frame by a sightless robot and a human operator. While admittance control is commonly used in pHRC tasks, it can be challenging to measure the force/torque applied by the human for accurate human intent estimation, limiting the robot's ability to assist in the collaborative task. Other methods that attempt to solve pHRC tasks using reinforcement learning (RL) are also unsuitable for the board-insertion task due to its safety constraints and sparse rewards. Therefore, we propose a novel RL approach that utilizes a human-designed admittance controller to facilitate more active robot behavior and reduce human effort. Through simulation and real-world experiments, we demonstrate that our approach outperforms admittance control in terms of success rate and task completion time. Additionally, we observed a significant reduction in measured force/torque when using our proposed approach compared to admittance control. The video of the experiments is available at https://youtu.be/va07Gw6YIog.

Collaborative Assembly Policy Learning of a Sightless Robot

TL;DR

Abstract

Collaborative Assembly Policy Learning of a Sightless Robot

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)