ArcPro: Architectural Programs for Structured 3D Abstraction of Sparse Points
Qirui Huang, Runze Zhang, Kangjun Liu, Minglun Gong, Hao Zhang, Hui Huang
TL;DR
ArcPro tackles the challenge of extracting structured 3D abstractions from extremely sparse and noisy architectural point clouds by learning a mapping from points to an architectural program P in a domain-specific language, and then compiling P into a mesh Y via a geometry compiler. The approach uses a 3D CNN encoder and a Transformer decoder to autoregressively predict tokenized programs under a syntax-constrained FSM, with synthetic data generated by procedural DSL-based generation to enable large-scale supervision. It demonstrates superior performance over traditional architectural proxy reconstruction and learning-based abstractions on SfM and sparse-point datasets, while also enabling extensions to multi-view images and natural language retrieval. The combination of architectural priors, programmatic representations, and efficient inference offers a scalable, interpretable pathway for urban modeling and digital twin applications, with potential impact on AR/VR, planning, and language-grounded architecture search.
Abstract
We introduce ArcPro, a novel learning framework built on architectural programs to recover structured 3D abstractions from highly sparse and low-quality point clouds. Specifically, we design a domain-specific language (DSL) to hierarchically represent building structures as a program, which can be efficiently converted into a mesh. We bridge feedforward and inverse procedural modeling by using a feedforward process for training data synthesis, allowing the network to make reverse predictions. We train an encoder-decoder on the points-program pairs to establish a mapping from unstructured point clouds to architectural programs, where a 3D convolutional encoder extracts point cloud features and a transformer decoder autoregressively predicts the programs in a tokenized form. Inference by our method is highly efficient and produces plausible and faithful 3D abstractions. Comprehensive experiments demonstrate that ArcPro outperforms both traditional architectural proxy reconstruction and learning-based abstraction methods. We further explore its potential to work with multi-view image and natural language inputs.
