Prompt-Driven Building Footprint Extraction in Aerial Images with Offset-Building Model
Kai Li, Yupeng Deng, Yunlong Kong, Diyou Liu, Jingbo Chen, Yu Meng, Junxian Ma, Chenhao Wang
TL;DR
This paper tackles building footprint extraction from very-high-resolution aerial imagery by moving to a prompt-based paradigm. It introduces the Offset-Building Model (OBM), which extends Segment Anything Model (SAM) with a Reference Offset Augment Module (ROAM) and a Distance-NMS framework to predict roof segmentation and precise roof-to-footprint offsets. The authors propose a comprehensive prompt-based evaluation framework and demonstrate that OBM achieves superior roof IoU and offset direction accuracy, with notable generalization to new datasets like Huizhou and OmniCity. The work also presents a new Huizhou test set for robust cross-domain validation and shows that prompt-level metrics can better reflect footprint quality in production settings. Overall, OBM, DNMS, and ROAM collectively enable accurate, scalable footprint extraction with reduced human intervention and improved generalization.
Abstract
More accurate extraction of invisible building footprints from very-high-resolution (VHR) aerial images relies on roof segmentation and roof-to-footprint offset extraction. Existing methods based on instance segmentation suffer from poor generalization when extended to large-scale data production and fail to achieve low-cost human interaction. This prompt paradigm inspires us to design a promptable framework for roof and offset extraction, and transforms end-to-end algorithms into promptable methods. Within this framework, we propose a novel Offset-Building Model (OBM). Based on prompt prediction, we first discover a common pattern of predicting offsets and tailored Distance-NMS (DNMS) algorithms for offset optimization. To rigorously evaluate the algorithm's capabilities, we introduce a prompt-based evaluation method, where our model reduces offset errors by 16.6\% and improves roof Intersection over Union (IoU) by 10.8\% compared to other models. Leveraging the common patterns in predicting offsets, DNMS algorithms enable models to further reduce offset vector loss by 6.5\%. To further validate the generalization of models, we tested them using a newly proposed test set, Huizhou test set, with over 7,000 manually annotated instance samples. Our algorithms and dataset will be available at https://github.com/likaiucas/OBM.
