Beyond the Frontier: Predicting Unseen Walls from Occupancy Grids by Learning from Floor Plans
Ludvig Ericson, Patric Jensfelt
TL;DR
This work addresses predicting unseen indoor walls from partial occupancy grids captured by a 360° LIDAR, framing the task as autoregressive prediction of wall-vertex sequences conditioned on sensor history and floor-plan priors. The proposed Floorist model uses a transformer encoder–decoder with cross-attention, encoding occupancy grids via ViT and visible-wall tokens via discrete embeddings, and generates wall segments as a sequence of tokens with a robust axis-aligned, subdivided representation. Data is synthesized by simulating robot trajectories on KTH floor plans, including axis alignment and segment subdivision, with evaluation via predicted information gain in frontier-based exploration and extensive ablations against a non-predictive baseline and an image-based predictor. The results show Floorist outperforms baselines in predicting information gain and wall geometry, scales with sensor range and grid area, and generalizes to a real-world office environment, highlighting potential for real-time floor-plan inference and improved exploration strategies. The work also provides open-source data and methodology to advance map-predictive exploration research.
Abstract
In this paper, we tackle the challenge of predicting the unseen walls of a partially observed environment as a set of 2D line segments, conditioned on occupancy grids integrated along the trajectory of a 360° LIDAR sensor. A dataset of such occupancy grids and their corresponding target wall segments is collected by navigating a virtual robot between a set of randomly sampled waypoints in a collection of office-scale floor plans from a university campus. The line segment prediction task is formulated as an autoregressive sequence prediction task, and an attention-based deep network is trained on the dataset. The sequence-based autoregressive formulation is evaluated through predicted information gain, as in frontier-based autonomous exploration, demonstrating significant improvements over both non-predictive estimation and convolution-based image prediction found in the literature. Ablations on key components are evaluated, as well as sensor range and the occupancy grid's metric area. Finally, model generality is validated by predicting walls in a novel floor plan reconstructed on-the-fly in a real-world office environment.
