FacaDiffy: Inpainting Unseen Facade Parts Using Diffusion Models
Thomas Froech, Olaf Wysocki, Yan Xia, Junyu Xie, Benedikt Schwab, Daniel Cremers, Thomas H. Kolbe
TL;DR
FacaDiffy tackles the problem of incomplete 2D facade conflict maps by combining a deterministic ray-casting pipeline with a personalized diffusion-based inpainting approach. It first computes conflict maps from existing 3D models and laser scans, then augments training data with synthetic conflict maps from random city models to personalize a Stable Diffusion inpainting model using DreamBooth. The method demonstrates state-of-the-art performance in conflict-map completion, yields notable improvements in LoD3 reconstruction detection rates, and offers a scalable path for deploying facade inpainting in real-world city-model pipelines. The work provides a practical workflow for enhancing semantic 3D building reconstruction through targeted, dataset-efficient personalization of generative models.
Abstract
High-detail semantic 3D building models are frequently utilized in robotics, geoinformatics, and computer vision. One key aspect of creating such models is employing 2D conflict maps that detect openings' locations in building facades. Yet, in reality, these maps are often incomplete due to obstacles encountered during laser scanning. To address this challenge, we introduce FacaDiffy, a novel method for inpainting unseen facade parts by completing conflict maps with a personalized Stable Diffusion model. Specifically, we first propose a deterministic ray analysis approach to derive 2D conflict maps from existing 3D building models and corresponding laser scanning point clouds. Furthermore, we facilitate the inpainting of unseen facade objects into these 2D conflict maps by leveraging the potential of personalizing a Stable Diffusion model. To complement the scarcity of real-world training data, we also develop a scalable pipeline to produce synthetic conflict maps using random city model generators and annotated facade images. Extensive experiments demonstrate that FacaDiffy achieves state-of-the-art performance in conflict map completion compared to various inpainting baselines and increases the detection rate by $22\%$ when applying the completed conflict maps for high-definition 3D semantic building reconstruction. The code is be publicly available in the corresponding GitHub repository: https://github.com/ThomasFroech/InpaintingofUnseenFacadeObjects
