Physics-Encoded Graph Neural Networks for Deformation Prediction under Contact
Mahdi Saleh, Michael Sommersperger, Nassir Navab, Federico Tombari
TL;DR
This work tackles the problem of predicting soft-object deformation under contact with rigid bodies in robotics. It introduces Physics-Encoded Graph Neural Networks that embed physical state on triangle-mesh graphs for both soft and rigid bodies and use cross-attention to model their interaction, followed by a decoder to reconstruct post-contact deformations. The method defines losses including $L_{mse}$ and a graph-consistency term, with total loss $L_T = L_{mse} + \lambda_G L_G$, enabling joint learning of geometry and physics. A new Everyday Deform dataset and a retina deformation dataset demonstrate accurate and efficient deformation predictions, and code and data are released to support research in robotic simulation and grasping.
Abstract
In robotics, it's crucial to understand object deformation during tactile interactions. A precise understanding of deformation can elevate robotic simulations and have broad implications across different industries. We introduce a method using Physics-Encoded Graph Neural Networks (GNNs) for such predictions. Similar to robotic grasping and manipulation scenarios, we focus on modeling the dynamics between a rigid mesh contacting a deformable mesh under external forces. Our approach represents both the soft body and the rigid body within graph structures, where nodes hold the physical states of the meshes. We also incorporate cross-attention mechanisms to capture the interplay between the objects. By jointly learning geometry and physics, our model reconstructs consistent and detailed deformations. We've made our code and dataset public to advance research in robotic simulation and grasping.
