Table of Contents
Fetching ...

FlexiCup: Wireless Multimodal Suction Cup with Dual-Zone Vision-Tactile Sensing

Junhao Gong, Shoujie Li, Kit-Wa Sou, Changqing Guo, Hourong Huang, Tong Wu, Yifan Xie, Chenxin Liang, Chuqiao Lyu, Xiaojun Liang, Wenbo Ding

TL;DR

FlexiCup addresses the sensing gap in traditional suction cups by delivering a fully wireless, dual-mode suction end effector with integrated vision-tactile sensing. The system uses illumination-controlled modality switching within a single optical path and supports both vacuum and Bernoulli mechanisms via modular bottom housings, enabling onboard perception, planning, and control. Key contributions include a modular hardware architecture, dual-zone sensing validated by multimodal recognition and modular grasping, and a diffusion-policy framework with multi-head attention for end-to-end contact-aware manipulation; results show high success across obstacle densities (vacuum 90.0% vs Bernoulli 86.7%) and effective end-to-end tasks (inclined transport 73.3%, orange extraction 66.7%). This work advances autonomous, contact-aware manipulation in unstructured settings and provides open hardware designs to accelerate research in multimodal suction manipulation.

Abstract

Conventional suction cups lack sensing capabilities for contact-aware manipulation in unstructured environments. This paper presents FlexiCup, a fully wireless multimodal suction cup that integrates dual-zone vision-tactile sensing. The central zone dynamically switches between vision and tactile modalities via illumination control for contact detection, while the peripheral zone provides continuous spatial awareness for approach planning. FlexiCup supports both vacuum and Bernoulli suction modes through modular mechanical configurations, achieving complete wireless autonomy with onboard computation and power. We validate hardware versatility through dual control paradigms. Modular perception-driven grasping across structured surfaces with varying obstacle densities demonstrates comparable performance between vacuum (90.0% mean success) and Bernoulli (86.7% mean success) modes. Diffusion-based end-to-end learning achieves 73.3% success on inclined transport and 66.7% on orange extraction tasks. Ablation studies confirm that multi-head attention coordinating dual-zone observations provides 13% improvements for contact-aware manipulation. Hardware designs and firmware are available at https://anonymous.4open.science/api/repo/FlexiCup-DA7D/file/index.html?v=8f531b44.

FlexiCup: Wireless Multimodal Suction Cup with Dual-Zone Vision-Tactile Sensing

TL;DR

FlexiCup addresses the sensing gap in traditional suction cups by delivering a fully wireless, dual-mode suction end effector with integrated vision-tactile sensing. The system uses illumination-controlled modality switching within a single optical path and supports both vacuum and Bernoulli mechanisms via modular bottom housings, enabling onboard perception, planning, and control. Key contributions include a modular hardware architecture, dual-zone sensing validated by multimodal recognition and modular grasping, and a diffusion-policy framework with multi-head attention for end-to-end contact-aware manipulation; results show high success across obstacle densities (vacuum 90.0% vs Bernoulli 86.7%) and effective end-to-end tasks (inclined transport 73.3%, orange extraction 66.7%). This work advances autonomous, contact-aware manipulation in unstructured settings and provides open hardware designs to accelerate research in multimodal suction manipulation.

Abstract

Conventional suction cups lack sensing capabilities for contact-aware manipulation in unstructured environments. This paper presents FlexiCup, a fully wireless multimodal suction cup that integrates dual-zone vision-tactile sensing. The central zone dynamically switches between vision and tactile modalities via illumination control for contact detection, while the peripheral zone provides continuous spatial awareness for approach planning. FlexiCup supports both vacuum and Bernoulli suction modes through modular mechanical configurations, achieving complete wireless autonomy with onboard computation and power. We validate hardware versatility through dual control paradigms. Modular perception-driven grasping across structured surfaces with varying obstacle densities demonstrates comparable performance between vacuum (90.0% mean success) and Bernoulli (86.7% mean success) modes. Diffusion-based end-to-end learning achieves 73.3% success on inclined transport and 66.7% on orange extraction tasks. Ablation studies confirm that multi-head attention coordinating dual-zone observations provides 13% improvements for contact-aware manipulation. Hardware designs and firmware are available at https://anonymous.4open.science/api/repo/FlexiCup-DA7D/file/index.html?v=8f531b44.

Paper Structure

This paper contains 19 sections, 9 figures, 3 tables.

Figures (9)

  • Figure 1: FlexiCup overview showing hardware features (top), modular perception-driven grasping approach (middle), and end-to-end learning framework (bottom). The system integrates active vision-tactile switching, wireless operation, and dual-mode suction capability supporting both vacuum and Bernoulli mechanisms.
  • Figure 2: System architecture showing (a) modular component integration, (b) wireless charging configuration, and (c) standalone charging module.
  • Figure 3: Reconfigurable dual-mode suction mechanisms and operational demonstrations.(a) Vacuum mode generates adhesion through inward airflow, while Bernoulli mode creates lifting force through outward airflow parallel to surfaces. (b) Vacuum mode handling deformable objects and Bernoulli mode handling PCB.
  • Figure 4: Vision-tactile sensing showing (a) dual-zone camera view and (b) modality switching via illumination control.
  • Figure 5: PDMS membrane design and fabrication. (a) PDMS membrane deformation during contact. (b) Dual-layer membrane fabrication process. (c) Four modular configurations (I-IV).
  • ...and 4 more figures