La La LiDAR: Large-Scale Layout Generation from LiDAR Data
Youquan Liu, Lingdong Kong, Weidong Yang, Xin Li, Ao Liang, Runnan Chen, Ben Fei, Tongliang Liu
TL;DR
La La LiDAR tackles the need for controllable 3D LiDAR scene generation by introducing a layout-guided diffusion framework that explicitly models foreground object relations through scene graphs. The two-stage approach first generates semantically-consistent layouts via a scene-graph diffusion process, then synthesizes foreground point clouds and completes the full scene with a Foreground-aware Control Injector that conditions background generation on foreground structure. The work provides two large-scale LiDAR scene graph datasets (Waymo-SG and nuScenes-SG) and new evaluation metrics, and demonstrates state-of-the-art performance in LiDAR layout fidelity, scene realism, and downstream perception tasks such as segmentation, object detection, and completion. This approach enables fine-grained, relation-aware control over driving scenarios, with strong implications for autonomous driving simulation, safety validation, and data augmentation.
Abstract
Controllable generation of realistic LiDAR scenes is crucial for applications such as autonomous driving and robotics. While recent diffusion-based models achieve high-fidelity LiDAR generation, they lack explicit control over foreground objects and spatial relationships, limiting their usefulness for scenario simulation and safety validation. To address these limitations, we propose Large-scale Layout-guided LiDAR generation model ("La La LiDAR"), a novel layout-guided generative framework that introduces semantic-enhanced scene graph diffusion with relation-aware contextual conditioning for structured LiDAR layout generation, followed by foreground-aware control injection for complete scene generation. This enables customizable control over object placement while ensuring spatial and semantic consistency. To support our structured LiDAR generation, we introduce Waymo-SG and nuScenes-SG, two large-scale LiDAR scene graph datasets, along with new evaluation metrics for layout synthesis. Extensive experiments demonstrate that La La LiDAR achieves state-of-the-art performance in both LiDAR generation and downstream perception tasks, establishing a new benchmark for controllable 3D scene generation.
