DISORF: A Distributed Online 3D Reconstruction Framework for Mobile Robots
Chunlin Li, Hanrui Fan, Xiaorui Huang, Ruofan Liang, Sankeerth Durvasula, Nandita Vijaykumar
TL;DR
DISORF presents a distributed framework for online 3D reconstruction on resource-constrained mobile robots by splitting computation between edge SLAM and a powerful remote server. It supports both NeRF and 3D Gaussian Splatting with online training, and addresses a key challenge—imbalanced frame sampling during streaming—by introducing a shifted exponential sampling strategy that emphasizes recent keyframes while preserving older ones. The framework leverages on-device pose estimation, keyframe streaming, and a Nerfstudio-based remote pipeline to deliver real-time visualization and high-quality 3D representations of unknown scenes. Empirical results on Replica and Tanks & Temples demonstrate improved rendering quality and practical viability, highlighting the approach’s potential for real-time robotics tasks in varying network conditions.
Abstract
We present a framework, DISORF, to enable online 3D reconstruction and visualization of scenes captured by resource-constrained mobile robots and edge devices. To address the limited computing capabilities of edge devices and potentially limited network availability, we design a framework that efficiently distributes computation between the edge device and the remote server. We leverage on-device SLAM systems to generate posed keyframes and transmit them to remote servers that can perform high-quality 3D reconstruction and visualization at runtime by leveraging recent advances in neural 3D methods. We identify a key challenge with online training where naive image sampling strategies can lead to significant degradation in rendering quality. We propose a novel shifted exponential frame sampling method that addresses this challenge for online training. We demonstrate the effectiveness of our framework in enabling high-quality real-time reconstruction and visualization of unknown scenes as they are captured and streamed from cameras in mobile robots and edge devices.
