LiDAR-EDIT: LiDAR Data Generation by Editing the Object Layouts in Real-World Scenes

Shing-Hei Ho; Bao Thach; Minghan Zhu

LiDAR-EDIT: LiDAR Data Generation by Editing the Object Layouts in Real-World Scenes

Shing-Hei Ho, Bao Thach, Minghan Zhu

TL;DR

LiDAR-EDIT addresses generating controllable, realistic synthetic LiDAR data by editing object layouts within real-world scans while preserving the background. Formally, the target scan is $P_T = f(S, D_T)$ given a background $S$ and edited foreground $D_T$, decomposed into object removal, point-cloud completion, and object insertion. The method introduces spherical voxelization with coordinates $(r, theta, phi)$ to mirror LiDAR rays and enforce consistent occlusion and density, plus a two-stage background inpainting pipeline inspired by UltraLidar. An object library of full-shape point clouds is constructed and completion is performed with AnchorFormer to enable arbitrary insertions. Experiments on nuScenes-LidarSeg demonstrate realistic edits with small domain gaps and show synthetic pretraining can improve downstream detection, highlighting practical impact for autonomous driving.

Abstract

We present LiDAR-EDIT, a novel paradigm for generating synthetic LiDAR data for autonomous driving. Our framework edits real-world LiDAR scans by introducing new object layouts while preserving the realism of the background environment. Compared to end-to-end frameworks that generate LiDAR point clouds from scratch, LiDAR-EDIT offers users full control over the object layout, including the number, type, and pose of objects, while keeping most of the original real-world background. Our method also provides object labels for the generated data. Compared to novel view synthesis techniques, our framework allows for the creation of counterfactual scenarios with object layouts significantly different from the original real-world scene. LiDAR-EDIT uses spherical voxelization to enforce correct LiDAR projective geometry in the generated point clouds by construction. During object removal and insertion, generative models are employed to fill the unseen background and object parts that were occluded in the original real LiDAR scans. Experimental results demonstrate that our framework produces realistic LiDAR scans with practical value for downstream tasks.

LiDAR-EDIT: LiDAR Data Generation by Editing the Object Layouts in Real-World Scenes

TL;DR

LiDAR-EDIT addresses generating controllable, realistic synthetic LiDAR data by editing object layouts within real-world scans while preserving the background. Formally, the target scan is

given a background

and edited foreground

, decomposed into object removal, point-cloud completion, and object insertion. The method introduces spherical voxelization with coordinates

to mirror LiDAR rays and enforce consistent occlusion and density, plus a two-stage background inpainting pipeline inspired by UltraLidar. An object library of full-shape point clouds is constructed and completion is performed with AnchorFormer to enable arbitrary insertions. Experiments on nuScenes-LidarSeg demonstrate realistic edits with small domain gaps and show synthetic pretraining can improve downstream detection, highlighting practical impact for autonomous driving.

LiDAR-EDIT: LiDAR Data Generation by Editing the Object Layouts in Real-World Scenes

TL;DR

Abstract

LiDAR-EDIT: LiDAR Data Generation by Editing the Object Layouts in Real-World Scenes

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)