UniFlow: Towards Zero-Shot LiDAR Scene Flow for Autonomous Vehicles via Cross-Domain Generalization

Siyi Li; Qingwen Zhang; Ishan Khatri; Kyle Vedder; Deva Ramanan; Neehar Peri

UniFlow: Towards Zero-Shot LiDAR Scene Flow for Autonomous Vehicles via Cross-Domain Generalization

Siyi Li, Qingwen Zhang, Ishan Khatri, Kyle Vedder, Deva Ramanan, Neehar Peri

TL;DR

This work tackles the generalization of LiDAR-based scene flow across diverse sensors and datasets. It introduces UniFlow, a simple multi-dataset training framework that unifies four AV datasets and retrains state-of-the-art scene-flow models, yielding substantial improvements in both in-domain and zero-shot generalization. The approach achieves new state-of-the-art results on Waymo and nuScenes and strong zero-shot performance on TruckScenes, with analyses highlighting the role of velocity distributions and low-level geometric priors in cross-domain transfer. The findings suggest that learning universal motion priors through dataset unification can greatly enhance robust 3D motion understanding for autonomous vehicles and beyond, while also identifying avenues for future work in non-AV domains and high-speed scenarios.

Abstract

LiDAR scene flow is the task of estimating per-point 3D motion between consecutive point clouds. Recent methods achieve centimeter-level accuracy on popular autonomous vehicle (AV) datasets, but are typically only trained and evaluated on a single sensor. In this paper, we aim to learn general motion priors that transfer to diverse and unseen LiDAR sensors. However, prior work in LiDAR semantic segmentation and 3D object detection demonstrate that naively training on multiple datasets yields worse performance than single dataset models. Interestingly, we find that this conventional wisdom does not hold for motion estimation, and that state-of-the-art scene flow methods greatly benefit from cross-dataset training. We posit that low-level tasks such as motion estimation may be less sensitive to sensor configuration; indeed, our analysis shows that models trained on fast-moving objects (e.g., from highway datasets) perform well on fast-moving objects, even across different datasets. Informed by our analysis, we propose UniFlow, a family of feedforward models that unifies and trains on multiple large-scale LiDAR scene flow datasets with diverse sensor placements and point cloud densities. Our frustratingly simple solution establishes a new state-of-the-art on Waymo and nuScenes, improving over prior work by 5.1% and 35.2% respectively. Moreover, UniFlow achieves state-of-the-art accuracy on unseen datasets like TruckScenes, outperforming prior TruckScenes-specific models by 30.1%.

UniFlow: Towards Zero-Shot LiDAR Scene Flow for Autonomous Vehicles via Cross-Domain Generalization

TL;DR

Abstract

UniFlow: Towards Zero-Shot LiDAR Scene Flow for Autonomous Vehicles via Cross-Domain Generalization

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)