Autonomous Robotic Assembly: From Part Singulation to Precise Assembly
Kei Ota, Devesh K. Jha, Siddarth Jain, Bill Yerazunis, Radu Corcodel, Yash Shukla, Antonia Bronars, Diego Romeres
TL;DR
This work addresses autonomous long-horizon assembly of a gearbox from parts presented in arbitrary configurations. It introduces a multi-modal system that fuses vision, GelSight tactile sensing, and force-torque feedback to perform part singulation, grasping, in-hand pose estimation, and high-precision insertion and meshing in a closed loop. The contributions include a benchmark-like assembly task, a hardware realization with integrated sensing and MuJoCo-based sim-to-real planning, and demonstrated end-to-end success of 225 varied trials with a 99.11% success rate, highlighting robustness and practical potential for flexible manufacturing. The paper also identifies failure modes and suggests future directions toward automatic failure recovery and interactive perception for unfamiliar parts.
Abstract
Imagine a robot that can assemble a functional product from the individual parts presented in any configuration to the robot. Designing such a robotic system is a complex problem which presents several open challenges. To bypass these challenges, the current generation of assembly systems is built with a lot of system integration effort to provide the structure and precision necessary for assembly. These systems are mostly responsible for part singulation, part kitting, and part detection, which is accomplished by intelligent system design. In this paper, we present autonomous assembly of a gear box with minimum requirements on structure. The assembly parts are randomly placed in a two-dimensional work environment for the robot. The proposed system makes use of several different manipulation skills such as sliding for grasping, in-hand manipulation, and insertion to assemble the gear box. All these tasks are run in a closed-loop fashion using vision, tactile, and Force-Torque (F/T) sensors. We perform extensive hardware experiments to show the robustness of the proposed methods as well as the overall system. See supplementary video at https://www.youtube.com/watch?v=cZ9M1DQ23OI.
