Table of Contents
Fetching ...

A Humanoid Visual-Tactile-Action Dataset for Contact-Rich Manipulation

Eunju Kwon, Seungwon Oh, In-Chang Baek, Yucheon Park, Gyungbo Kim, JaeYoung Moon, Yunho Choi, Kyung-Joong Kim

TL;DR

The paper tackles contact-rich manipulation of deformable objects by leveraging a humanoid robot with multi-modal sensing to capture rich interaction signals. It introduces a dense visual-tactile-action dataset collected via teleoperation, totaling 101.9k frames across towel and sponge tasks under strong/weak pressure, with proprioception, egocentric vision, dense tactile maps from Inspire Hands, and tactile heatmaps. A neural fusion model based on dense tactile information demonstrates the utility and reveals optimization challenges associated with high-dimensional tactile inputs. Contributions include the first humanoid visual-tactile-action dataset for soft-object manipulation, a dense-tactile fusion architecture, and comprehensive data analysis guiding future dataset expansion and optimization strategies.

Abstract

Contact-rich manipulation has become increasingly important in robot learning. However, previous studies on robot learning datasets have focused on rigid objects and underrepresented the diversity of pressure conditions for real-world manipulation. To address this gap, we present a humanoid visual-tactile-action dataset designed for manipulating deformable soft objects. The dataset was collected via teleoperation using a humanoid robot equipped with dexterous hands, capturing multi-modal interactions under varying pressure conditions. This work also motivates future research on models with advanced optimization strategies capable of effectively leveraging the complexity and diversity of tactile signals.

A Humanoid Visual-Tactile-Action Dataset for Contact-Rich Manipulation

TL;DR

The paper tackles contact-rich manipulation of deformable objects by leveraging a humanoid robot with multi-modal sensing to capture rich interaction signals. It introduces a dense visual-tactile-action dataset collected via teleoperation, totaling 101.9k frames across towel and sponge tasks under strong/weak pressure, with proprioception, egocentric vision, dense tactile maps from Inspire Hands, and tactile heatmaps. A neural fusion model based on dense tactile information demonstrates the utility and reveals optimization challenges associated with high-dimensional tactile inputs. Contributions include the first humanoid visual-tactile-action dataset for soft-object manipulation, a dense-tactile fusion architecture, and comprehensive data analysis guiding future dataset expansion and optimization strategies.

Abstract

Contact-rich manipulation has become increasingly important in robot learning. However, previous studies on robot learning datasets have focused on rigid objects and underrepresented the diversity of pressure conditions for real-world manipulation. To address this gap, we present a humanoid visual-tactile-action dataset designed for manipulating deformable soft objects. The dataset was collected via teleoperation using a humanoid robot equipped with dexterous hands, capturing multi-modal interactions under varying pressure conditions. This work also motivates future research on models with advanced optimization strategies capable of effectively leveraging the complexity and diversity of tactile signals.

Paper Structure

This paper contains 10 sections, 5 figures, 1 table.

Figures (5)

  • Figure 1: Overview of the teleoperation framework, showing the humanoid robot control setup and the soft and rigid objects used in the experiments.
  • Figure 2: Comparison of tactile signal distributions captured from the dexterous hand when interacting with rigid objects and deformable soft objects, highlighting differences in contact dynamics.
  • Figure 3: t-SNE embedding of tactile signals under dense and sparse sensing configurations, showing the separability of pressure patterns across different contact conditions.
  • Figure 4: Training and test curves of dense and sparse tactile models across all manipulation tasks.
  • Figure 5: Real-world manipulation experiments.