Multi-Body Neural Scene Flow
Kavisha Vidanapathirana, Shin-Fang Chng, Xueqian Li, Simon Lucey
TL;DR
MBNSF introduces a multi-body rigidity regularizer that enforces approximate isometry within clusters of a source point cloud to induce SE(3) rigidity without explicitly estimating rigid-body transformations. By coupling this isometric-flow regularization with a continuous neural scene-flow prior, MBNSF preserves continuous motion fields and enables accurate long-term 4D trajectory predictions. The approach leverages DBSCAN clustering and a robust spectral objective to identify and preserve rigid-body relationships, improving scene flow and trajectory performance on real-world LiDAR datasets (Argoverse and Waymo) over state-of-the-art NSFP variants. It also provides practical integrations with NSFP and NTP and demonstrates favorable memory and efficiency characteristics compared to per-cluster alternatives, at the cost of increased offline optimization time. The work advances unsupervised, continuous scene flow estimation and long-term motion modeling in dynamic 3D scenes, with broad applicability to autonomous driving and dynamic scene understanding.
Abstract
The test-time optimization of scene flow - using a coordinate network as a neural prior - has gained popularity due to its simplicity, lack of dataset bias, and state-of-the-art performance. We observe, however, that although coordinate networks capture general motions by implicitly regularizing the scene flow predictions to be spatially smooth, the neural prior by itself is unable to identify the underlying multi-body rigid motions present in real-world data. To address this, we show that multi-body rigidity can be achieved without the cumbersome and brittle strategy of constraining the $SE(3)$ parameters of each rigid body as done in previous works. This is achieved by regularizing the scene flow optimization to encourage isometry in flow predictions for rigid bodies. This strategy enables multi-body rigidity in scene flow while maintaining a continuous flow field, hence allowing dense long-term scene flow integration across a sequence of point clouds. We conduct extensive experiments on real-world datasets and demonstrate that our approach outperforms the state-of-the-art in 3D scene flow and long-term point-wise 4D trajectory prediction. The code is available at: https://github.com/kavisha725/MBNSF.
