Lightweight Learning from Actuation-Space Demonstrations via Flow Matching for Whole-Body Soft Robotic Grasping

Liudi Yang; Yang Bai; Yuhao Wang; Ibrahim Alsarraj; Gitta Kutyniok; Zhanchi Wang; Ke Wu

Lightweight Learning from Actuation-Space Demonstrations via Flow Matching for Whole-Body Soft Robotic Grasping

Liudi Yang, Yang Bai, Yuhao Wang, Ibrahim Alsarraj, Gitta Kutyniok, Zhanchi Wang, Ke Wu

TL;DR

The paper tackles robust whole-body grasping under uncertainty by leveraging soft robot embodied intelligence and a lightweight actuation-space learning approach. It introduces Rectified Flow to model distributional actuation strategies directly from a small set of demonstrations, avoiding dense sensing and heavy feedback loops. Demonstrated on a tendon-driven SpiRob, the method achieves up to $97.5\%$ grasp success across the workspace from only $30$ demos, generalizes to object size variations of about $\pm33\%$, and remains robust under temporal scaling from $20\%$ to $200\%$ of the reference duration, illustrating strong sim-to-real transfer. Collectively, the work shows that actuation-space learning can convert passive body compliance into functional control intelligence, reducing reliance on centralized controllers for uncertain, contact-rich tasks.

Abstract

Robotic grasping under uncertainty remains a fundamental challenge due to its uncertain and contact-rich nature. Traditional rigid robotic hands, with limited degrees of freedom and compliance, rely on complex model-based and heavy feedback controllers to manage such interactions. Soft robots, by contrast, exhibit embodied mechanical intelligence: their underactuated structures and passive flexibility of their whole body, naturally accommodate uncertain contacts and enable adaptive behaviors. To harness this capability, we propose a lightweight actuation-space learning framework that infers distributional control representations for whole-body soft robotic grasping, directly from deterministic demonstrations using a flow matching model (Rectified Flow),without requiring dense sensing or heavy control loops. Using only 30 demonstrations (less than 8% of the reachable workspace), the learned policy achieves a 97.5% grasp success rate across the whole workspace, generalizes to grasped-object size variations of +-33%, and maintains stable performance when the robot's dynamic response is directly adjusted by scaling the execution time from 20% to 200%. These results demonstrate that actuation-space learning, by leveraging its passive redundant DOFs and flexibility, converts the body's mechanics into functional control intelligence and substantially reduces the burden on central controllers for this uncertain-rich task.

Lightweight Learning from Actuation-Space Demonstrations via Flow Matching for Whole-Body Soft Robotic Grasping

TL;DR

Abstract

Lightweight Learning from Actuation-Space Demonstrations via Flow Matching for Whole-Body Soft Robotic Grasping

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (10)