Manipulating Elasto-Plastic Objects With 3D Occupancy and Learning-Based Predictive Control
Zhen Zhang, Xiangyu Chu, Yunxi Tang, Lulu Zhao, Jing Huang, Zhongliang Jiang, K. W. Samuel Au
TL;DR
This work addresses the challenge of manipulating elasto-plastic volumetric objects under quasi-static motion by proposing a dense 3D occupancy representation and a learning-based framework that combines a 3D CNN-GNN dynamics model with model predictive control. It introduces a data collection platform and a pipeline to generate dense 3D occupancy ground truth from multi-view RGB-D data, training an occupancy prediction network that operates during manipulation. The proposed approach demonstrates accurate state representation, improved dynamics prediction, and successful shaping of plasticine into target geometries in both simulation and real-world experiments, outperforming voxel-based baselines. By enabling dense internal-state reasoning and occlusion-robust planning, the framework advances practical manipulation of soft, irreversible materials with potential impact on everyday robotics and deformable-object handling tasks.
Abstract
Manipulating elasto-plastic objects remains a significant challenge due to severe self-occlusion, difficulties of representation, and complicated dynamics. This work proposes a novel framework for elasto-plastic object manipulation with a quasi-static assumption for motions, leveraging 3D occupancy to represent such objects, a learned dynamics model trained with 3D occupancy, and a learning-based predictive control algorithm to address these challenges effectively. We build a novel data collection platform to collect full spatial information and propose a pipeline for generating a 3D occupancy dataset. To infer the 3D occupancy during manipulation, an occupancy prediction network is trained with multiple RGB images supervised by the generated dataset. We design a deep neural network empowered by a 3D convolution neural network (CNN) and a graph neural network (GNN) to predict the complex deformation with the inferred 3D occupancy results. A learning-based predictive control algorithm is introduced to plan the robot actions, incorporating a novel shape-based action initialization module specifically designed to improve the planner efficiency. The proposed framework in this paper can successfully shape the elasto-plastic objects into a given goal shape and has been verified in various experiments both in simulation and the real world.
