Toward Scalable, Flexible Scene Flow for Point Clouds

Kyle Vedder

Toward Scalable, Flexible Scene Flow for Point Clouds

Kyle Vedder

TL;DR

This work advances scalable, flexible scene flow for point clouds by combining unsupervised distillation (ZeroFlow), a simple yet effective tracking-based evaluation (Bucket Normalized EPE and TrackFlow), and a novel Eulerian, ODE-based formulation (EulerFlow) that enables long-horizon motion modeling. ZeroFlow demonstrates that large-scale, pseudo-labeled data can outperform costly human labels while running in real time, highlighting data diversity and architecture choices as critical drivers. TrackFlow reveals that standard metrics obscure failures on small objects, prompting a more nuanced evaluation and a simple baseline that achieves strong performance on safety-critical categories. EulerFlow sets a new state-of-the-art in unsupervised scene flow by modeling motion as a continuous-time ODE over an entire sequence, enabling emergent 3D point tracking and broad domain applicability beyond autonomous vehicles. Collectively, these contributions pave the way for robust, scalable, and broadly applicable motion understanding in 3D scenes.

Abstract

Scene flow estimation is the task of describing 3D motion between temporally successive observations. This thesis aims to build the foundation for building scene flow estimators with two important properties: they are scalable, i.e. they improve with access to more data and computation, and they are flexible, i.e. they work out-of-the-box in a variety of domains and on a variety of motion patterns without requiring significant hyperparameter tuning. In this dissertation we present several concrete contributions towards this. In Chapter 1 we contextualize scene flow and its prior methods. In Chapter 2 we present a blueprint to build and scale feedforward scene flow estimators without requiring expensive human annotations via large scale distillation from pseudolabels provided by strong unsupervised test-time optimization methods. In Chapter 3 we introduce a benchmark to better measure estimate quality across diverse object types, better bringing into focus what we care about and expect from scene flow estimators, and use this benchmark to host a public challenge that produced significant progress. In Chapter 4 we present a state-of-the-art unsupervised scene flow estimator that introduces a new, full sequence problem formulation and exhibits great promise in adjacent domains like 3D point tracking. Finally, in Chapter 5 I philosophize about what's next for scene flow and its potential future broader impacts.

Toward Scalable, Flexible Scene Flow for Point Clouds

TL;DR

Abstract

Toward Scalable, Flexible Scene Flow for Point Clouds

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (51)