Map Imagination Like Blind Humans: Group Diffusion Model for Robotic Map Generation
Qijin Song, Weibang Bai
TL;DR
This work tackles the challenge of building large-scale 3D point cloud maps from severely limited perception, inspired by blind humans' mental cartography. It introduces a Group Diffusion Model (GDM) that partitions a map into groups and applies diffusion-denoising within each group, using a two-stage process: Stage 1 generates central points from path data, and Stage 2 performs group-wise denoising to yield a detailed map. The method yields reasonable maps from path data alone and further benefits from sparse LiDAR cues, significantly reducing sensor dependency compared to traditional LiDAR/vision-based approaches. Practically, this enables robots to imagine and generate basic maps with minimal onboard sensing, potentially supporting navigation and planning in sensor-constrained scenarios. The approach combines diffusion theory with a sparse Unet backbone and demonstrates robustness across shapes and large-scale extents.
Abstract
Can robots imagine or generate maps like humans do, especially when only limited information can be perceived like blind people? To address this challenging task, we propose a novel group diffusion model (GDM) based architecture for robots to generate point cloud maps with very limited input information.Inspired from the blind humans' natural capability of imagining or generating mental maps, the proposed method can generate maps without visual perception data or depth data. With additional limited super-sparse spatial positioning data, like the extra contact-based positioning information the blind individuals can obtain, the map generation quality can be improved even more.Experiments on public datasets are conducted, and the results indicate that our method can generate reasonable maps solely based on path data, and produce even more refined maps upon incorporating exiguous LiDAR data.Compared to conventional mapping approaches, our novel method significantly mitigates sensor dependency, enabling the robots to imagine and generate elementary maps without heavy onboard sensory devices.
