Force Generative Imitation Learning: Bridging Position Trajectory and Force Commands through Control Technique

Hiroshi Sato; Sho Sakaino; Toshiaki Tsuji

Force Generative Imitation Learning: Bridging Position Trajectory and Force Commands through Control Technique

Hiroshi Sato, Sho Sakaino, Toshiaki Tsuji

TL;DR

This work tackles the challenge of generating force commands for contact-rich manipulation by introducing force generative imitation learning that converts position trajectories into force commands. It deploys a hierarchical architecture with a memory-based upper layer and a memoryless lower layer, combined with a PID feedback loop to ensure stable tracking. The approach demonstrates improved accuracy and generalization on a character-writing task using a CRANE-X7 manipulator, showing that force-aware control can be achieved without memory-induced instability. Time-scale separation between the upper and lower layers enables robust, memoryless control while leveraging learned trajectory predictions for force generation.

Abstract

In contact-rich tasks, while position trajectories are often easy to obtain, appropriate force commands are typically unknown. Although it is conceivable to generate force commands using a pretrained foundation model such as Vision-Language-Action (VLA) models, force control is highly dependent on the specific hardware of the robot, which makes the application of such models challenging. To bridge this gap, we propose a force generative model that estimates force commands from given position trajectories. However, when dealing with unseen position trajectories, the model struggles to generate accurate force commands. To address this, we introduce a feedback control mechanism. Our experiments reveal that feedback control does not converge when the force generative model has memory. We therefore adopt a model without memory, enabling stable feedback control. This approach allows the system to generate force commands effectively, even for unseen position trajectories, improving generalization for real-world robot writing tasks.

Force Generative Imitation Learning: Bridging Position Trajectory and Force Commands through Control Technique

TL;DR

Abstract

Paper Structure (20 sections, 1 equation, 12 figures, 1 table)

This paper contains 20 sections, 1 equation, 12 figures, 1 table.

Introduction
Related Work
Bilateral Control-Based Imitation Learning
Hierarchical Model
World Model with Control System
Force Generative Imitation Learning
Hierarchical Architecture Separating Memory-Based and Memoryless Neural Networks
PID control
Experiment
Manipulator
Task design and data collection
Training NN
Verification of PID Gain Characteristics
Comparison with conventional methods
Verification of generalizability
...and 5 more sections

Figures (12)

Figure 1: Accuracy of character writing improved by the proposed method
Figure 2: The proposed hierarchical model for correcting output errors
Figure 3: Task environment
Figure 4: LSTM model
Figure 5: MLP model
...and 7 more figures

Force Generative Imitation Learning: Bridging Position Trajectory and Force Commands through Control Technique

TL;DR

Abstract

Force Generative Imitation Learning: Bridging Position Trajectory and Force Commands through Control Technique

Authors

TL;DR

Abstract

Table of Contents

Figures (12)