EgoFlowNet: Non-Rigid Scene Flow from Point Clouds with Ego-Motion Support
Ramy Battrawy, René Schuster, Didier Stricker
TL;DR
EgoFlowNet tackles non-rigid scene flow estimation from LiDAR point clouds under weak supervision by avoiding clustering and object-level rigidity assumptions. It jointly predicts a point-level foreground/background segmentation mask $M_{fg}$ and flows for both ego-motion and scene flow across a four-scale, coarse-to-fine pipeline with a shared cost volume and hybrid features, enabling robust non-rigid motion estimation. The ego-motion branch uses correspondences and the Kabsch algorithm to estimate $(\hat{R},\hat{t})$, while the scene-flow branch performs multi-stage refinement with dual attention to produce $\hat{S}_k$, and BG points have their flow merged via $M^P_{bg}$ using the predicted ego-motion. The method achieves state-of-the-art performance on KITTI datasets in the presence of ground points, offering strong accuracy, efficiency (~$140\mathrm{ms}$ per frame on a Titan V), and robustness to occlusions, marking a substantive advance in clustering-free, point-level scene flow for autonomous driving.
Abstract
Recent weakly-supervised methods for scene flow estimation from LiDAR point clouds are limited to explicit reasoning on object-level. These methods perform multiple iterative optimizations for each rigid object, which makes them vulnerable to clustering robustness. In this paper, we propose our EgoFlowNet - a point-level scene flow estimation network trained in a weakly-supervised manner and without object-based abstraction. Our approach predicts a binary segmentation mask that implicitly drives two parallel branches for ego-motion and scene flow. Unlike previous methods, we provide both branches with all input points and carefully integrate the binary mask into the feature extraction and losses. We also use a shared cost volume with local refinement that is updated at multiple scales without explicit clustering or rigidity assumptions. On realistic KITTI scenes, we show that our EgoFlowNet performs better than state-of-the-art methods in the presence of ground surface points.
