Object Dynamics Modeling with Hierarchical Point Cloud-based Representations
Chanho Kim, Li Fuxin
TL;DR
This work addresses object-dynamics prediction in 3D by introducing a point-based, geometry-aware neural network that operates on dense point clouds and meshes. It combines Object PointConv for within-object force propagation with Relational PointConv for inter-object interactions inside a hierarchical U-Net, and extends to mesh data by computing interaction points on faces. The approach yields state-of-the-art results on gravity- and collision-centric tasks, outperforming graph neural network baselines on Physion and Kubric datasets, with notable gains in non-rigid drape scenarios and when sampling densely on surfaces. By leveraging continuous point convolutions and an object-centric hierarchy, the method provides accurate, scalable dynamics modeling that can bridge advancements in point-cloud and graph-based physics learning.
Abstract
Modeling object dynamics with a neural network is an important problem with numerous applications. Most recent work has been based on graph neural networks. However, physics happens in 3D space, where geometric information potentially plays an important role in modeling physical phenomena. In this work, we propose a novel U-net architecture based on continuous point convolution which naturally embeds information from 3D coordinates and allows for multi-scale feature representations with established downsampling and upsampling procedures. Bottleneck layers in the downsampled point clouds lead to better long-range interaction modeling. Besides, the flexibility of point convolutions allows our approach to generalize to sparsely sampled points from mesh vertices and dynamically generate features on important interaction points on mesh faces. Experimental results demonstrate that our approach significantly improves the state-of-the-art, especially in scenarios that require accurate gravity or collision reasoning.
