NeuFlow: Real-time, High-accuracy Optical Flow Estimation on Robots Using Edge Devices
Zhiyong Zhang, Huaizu Jiang, Hanumant Singh
TL;DR
NeuFlow tackles real-time, high-accuracy optical flow on edge devices by combining a lightweight multi-scale CNN backbone with global cross-attention at $1/16$ resolution and a local refinement stage at $1/8$, followed by convex upsampling. It achieves comparable accuracy to leading methods like RAFT, GMFlow, and FlowFormer while delivering 10×–70× higher throughput on GPUs and around 30 FPS on Jetson Orin Nano, enabling real-time deployment in SLAM and visual odometry. Trained on FlyingChairs and FlyingThings and evaluated on FlyingThings and Sintel, NeuFlow demonstrates strong cross-domain generalization and practical applicability to robotic workflows. The authors also provide open-source code and weights to facilitate adoption and further research in edge-based optical flow for SWaP-C platforms.
Abstract
Real-time high-accuracy optical flow estimation is a crucial component in various applications, including localization and mapping in robotics, object tracking, and activity recognition in computer vision. While recent learning-based optical flow methods have achieved high accuracy, they often come with heavy computation costs. In this paper, we propose a highly efficient optical flow architecture, called NeuFlow, that addresses both high accuracy and computational cost concerns. The architecture follows a global-to-local scheme. Given the features of the input images extracted at different spatial resolutions, global matching is employed to estimate an initial optical flow on the 1/16 resolution, capturing large displacement, which is then refined on the 1/8 resolution with lightweight CNN layers for better accuracy. We evaluate our approach on Jetson Orin Nano and RTX 2080 to demonstrate efficiency improvements across different computing platforms. We achieve a notable 10x-80x speedup compared to several state-of-the-art methods, while maintaining comparable accuracy. Our approach achieves around 30 FPS on edge computing platforms, which represents a significant breakthrough in deploying complex computer vision tasks such as SLAM on small robots like drones. The full training and evaluation code is available at https://github.com/neufieldrobotics/NeuFlow.
