Table of Contents
Fetching ...

The Role of Tactile Sensing for Learning Reach and Grasp

Boya Zhang, Iris Andrussow, Andreas Zell, Georg Martius

TL;DR

This work investigates how different tactile sensing modalities affect reinforcement-learning–driven reach and 2‑finger antipodal grasping, using a parallelized simulation pipeline and real‑robot tests with a Minsight‑like tactile setup. By comparing sensing area and data types (e.g., binary, magnitude, and full force vectors) under perfect and degraded vision, the study finds that tactile information is most beneficial when vision is imperfect, but high spatial resolution is not strictly necessary; a sum‑force vector input often yields the strongest gains and supports generalization. Across sim and sim‑to‑real tests, tactile cues improve robustness and transfer, though a notable sim‑to‑real gap remains, partly due to contact dynamics and sensor fidelity. The results offer practical guidance for designing tactile sensing in 2‑finger RL grasping systems and motivate future work on blind grasping and multi‑finger manipulation in more diverse environments.

Abstract

Stable and robust robotic grasping is essential for current and future robot applications. In recent works, the use of large datasets and supervised learning has enhanced speed and precision in antipodal grasping. However, these methods struggle with perception and calibration errors due to large planning horizons. To obtain more robust and reactive grasping motions, leveraging reinforcement learning combined with tactile sensing is a promising direction. Yet, there is no systematic evaluation of how the complexity of force-based tactile sensing affects the learning behavior for grasping tasks. This paper compares various tactile and environmental setups using two model-free reinforcement learning approaches for antipodal grasping. Our findings suggest that under imperfect visual perception, various tactile features improve learning outcomes, while complex tactile inputs complicate training.

The Role of Tactile Sensing for Learning Reach and Grasp

TL;DR

This work investigates how different tactile sensing modalities affect reinforcement-learning–driven reach and 2‑finger antipodal grasping, using a parallelized simulation pipeline and real‑robot tests with a Minsight‑like tactile setup. By comparing sensing area and data types (e.g., binary, magnitude, and full force vectors) under perfect and degraded vision, the study finds that tactile information is most beneficial when vision is imperfect, but high spatial resolution is not strictly necessary; a sum‑force vector input often yields the strongest gains and supports generalization. Across sim and sim‑to‑real tests, tactile cues improve robustness and transfer, though a notable sim‑to‑real gap remains, partly due to contact dynamics and sensor fidelity. The results offer practical guidance for designing tactile sensing in 2‑finger RL grasping systems and motivate future work on blind grasping and multi‑finger manipulation in more diverse environments.

Abstract

Stable and robust robotic grasping is essential for current and future robot applications. In recent works, the use of large datasets and supervised learning has enhanced speed and precision in antipodal grasping. However, these methods struggle with perception and calibration errors due to large planning horizons. To obtain more robust and reactive grasping motions, leveraging reinforcement learning combined with tactile sensing is a promising direction. Yet, there is no systematic evaluation of how the complexity of force-based tactile sensing affects the learning behavior for grasping tasks. This paper compares various tactile and environmental setups using two model-free reinforcement learning approaches for antipodal grasping. Our findings suggest that under imperfect visual perception, various tactile features improve learning outcomes, while complex tactile inputs complicate training.

Paper Structure

This paper contains 16 sections, 11 figures, 4 tables.

Figures (11)

  • Figure 1: Simulation setup: (a) environment for tabletop grasping with panda robot and antipodal gripper. (b) Top row: three main objects; Bottom row: samples from 100 objects. (c) Approximation of the tactile sensor next to the real Minsight sensor Andrussow23-AIS-Minsight.
  • Figure 2: Tactile sensor classification according to the sensing area (side/coverage and units/resolution) and sensing type (color). Sensors created or used by: Romero et al.romero2020soft, TacTip ward2018tactip, ReSkin bhirangi2021reskin, da Fonseca et al. da2022tactile, Ding et al. ding2021sim, Pitz et al. pitz2023dextrous, Koiva et al. koiva2013highly, uSkin funabashi2020stable, BioTac Lin2013EstimatingPO, GelSight yuan2017gelsight, GelSlim taylor2022gelslim, GelTip gomes2020geltip, GelSight360 tippur2023gelsight360, Minsight Andrussow23-AIS-Minsight.
  • Figure 3: Approximation of tactile sensors with different sensing areas, left (view from the inner side), right (view from the back side). A: the whole area of the sensor gives force feedback (used for detailed force map or sum of forces). K=5: force feedback only for sites on inner center line. K=9: extended center line to include also the back side. K=12: provides dense feedback for the area most touched on the inner side of the sensor
  • Figure 4: Training with ideal visual and tactile sensing for SAC and MPO.
  • Figure 5: Comparison between 2 memory lengths using SAC:left:1, right:5.
  • ...and 6 more figures