ILBiT: Imitation Learning for Robot Using Position and Torque Information based on Bilateral Control with Transformer

Masato Kobayashi; Thanpimon Buamanee; Yuki Uranishi; Haruo Takemura

ILBiT: Imitation Learning for Robot Using Position and Torque Information based on Bilateral Control with Transformer

Masato Kobayashi, Thanpimon Buamanee, Yuki Uranishi, Haruo Takemura

TL;DR

This work tackles autonomous robotic manipulation by combining imitation learning with bilateral control and Transformer-based sequence modeling. By collecting rich, torque-inclusive demonstrations via a four-channel bilateral setup and training a Transformer encoder, ILBiT predicts leader actions for fast, force-aware execution at 100 Hz. Experiments on a two-robot OpenMANIPULATOR-X setup show ILBiT generalizes better to untrained objects and tasks than LSTM-based baselines, with higher success rates across pick, move, and place actions. The approach offers improved adaptability and speed for real-world manipulation, with potential applicability to broader robotic platforms and dynamic environments.

Abstract

Autonomous manipulation in robot arms is a complex and evolving field of study in robotics. This paper introduces an innovative approach to this challenge by focusing on imitation learning (IL). Unlike traditional imitation methods, our approach uses IL based on bilateral control, allowing for more precise and adaptable robot movements. The conventional IL based on bilateral control method have relied on Long Short-Term Memory (LSTM) networks. In this paper, we present the IL for robot using position and torque information based on Bilateral control with Transformer (ILBiT). This proposed method employs the Transformer model, known for its robust performance in handling diverse datasets and its capability to surpass LSTM's limitations, especially in tasks requiring detailed force adjustments. A standout feature of ILBiT is its high-frequency operation at 100 Hz, which significantly improves the system's adaptability and response to varying environments and objects of different hardness levels. The effectiveness of the Transformer-based ILBiT method can be seen through comprehensive real-world experiments.

ILBiT: Imitation Learning for Robot Using Position and Torque Information based on Bilateral Control with Transformer

TL;DR

Abstract

Paper Structure (18 sections, 6 equations, 17 figures, 3 tables)

This paper contains 18 sections, 6 equations, 17 figures, 3 tables.

Introduction
Robot System
Controller
Bilateral Control
Imitation Learning System Based on Bilateral Control
Imitation Learning based on Bilateral Control with Transformer (ILBiT)
Overall of ILBiT
Data Collection via Bilateral Control
Transformer Model for Learning and Prediction
Execution through Autonomous Control
Experiment
Hardware
Environment Setup
Task Setting
Training Dataset
...and 3 more sections

Figures (17)

Figure 1: Image of Bilateral Control
Figure 2: Block Diagram of Robot System
Figure 3: Block Diagram of Four-channel Bilateral Control
Figure 4: Block Diagram of Imitation Learning based on Bilateral Control
Figure 5: Image Diagram of LSTM Model (Conventional Method)
...and 12 more figures

ILBiT: Imitation Learning for Robot Using Position and Torque Information based on Bilateral Control with Transformer

TL;DR

Abstract

ILBiT: Imitation Learning for Robot Using Position and Torque Information based on Bilateral Control with Transformer

Authors

TL;DR

Abstract

Table of Contents

Figures (17)