Fast LiDAR Data Generation with Rectified Flows
Kazuto Nakashima, Xiaowen Liu, Tomoya Miyawaki, Yumi Iwashita, Ryo Kurazume
TL;DR
This work tackles the computational burden of diffusion-based LiDAR generation by introducing R2Flow, a rectified-flow model that yields straight trajectories and enables fast, high-fidelity sampling using an ODE $d\mathbf{x}_t/dt = v_\theta(\mathbf{x}_t,t)$ with a linear path $\mathbf{x}_t = t\mathbf{x}_1+(1-t)\mathbf{x}_0$. It operates on pixel-space 2-channel equirectangular LiDAR images (range and reflectance) and employs a Transformer-based velocity estimator (HDiT) with architectural tweaks to handle panoramic LiDAR data, using timesteps distillation and reflow to achieve few-step sampling. The method is evaluated on the KITTI-360 dataset for unconditional generation and compared against GANs and diffusion baselines across multiple fidelity/diversity metrics, showing competitive results with substantially reduced sampling steps. The findings highlight R2Flow’s potential as a practical LiDAR data priors tool for restoration, sim-to-real, and domain-adaptation tasks in robotics, while suggesting avenues for raydrop-aware modeling and broader application to sparse-to-dense completion and anomaly detection.
Abstract
Building LiDAR generative models holds promise as powerful data priors for restoration, scene manipulation, and scalable simulation in autonomous mobile robots. In recent years, approaches using diffusion models have emerged, significantly improving training stability and generation quality. Despite their success, diffusion models require numerous iterations of running neural networks to generate high-quality samples, making the increasing computational cost a potential barrier for robotics applications. To address this challenge, this paper presents R2Flow, a fast and high-fidelity generative model for LiDAR data. Our method is based on rectified flows that learn straight trajectories, simulating data generation with significantly fewer sampling steps compared to diffusion models. We also propose an efficient Transformer-based model architecture for processing the image representation of LiDAR range and reflectance measurements. Our experiments on unconditional LiDAR data generation using the KITTI-360 dataset demonstrate the effectiveness of our approach in terms of both efficiency and quality.
