Generating 3D House Wireframes with Semantics
Xueqi Ma, Yilin Liu, Wenjun Zhou, Ruowei Wang, Hui Huang
TL;DR
This work tackles unconditional generation of semantically enriched 3D house wireframes. It introduces a wire-based representation and semantic sequencing, coupled with a two-stage pipeline: a graph-based autoencoder learns a quantized geometric vocabulary and a transformer decoder autoregressively generates semantically ordered wire segments via BFS grouping. Key innovations include Local Multi-Head Attention in the encoder, Residual LFQ quantization, and a coarse-to-fine transformer to produce coherent line and vertex embeddings. Experimental results on a newly created 3D house wireframe dataset show superior accuracy, novelty, and semantic fidelity compared to baselines, with qualitative analyses and user studies supporting practical usefulness for CAD and VR applications.
Abstract
We present a new approach for generating 3D house wireframes with semantic enrichment using an autoregressive model. Unlike conventional generative models that independently process vertices, edges, and faces, our approach employs a unified wire-based representation for improved coherence in learning 3D wireframe structures. By re-ordering wire sequences based on semantic meanings, we facilitate seamless semantic integration during sequence generation. Our two-phase technique merges a graph-based autoencoder with a transformer-based decoder to learn latent geometric tokens and generate semantic-aware wireframes. Through iterative prediction and decoding during inference, our model produces detailed wireframes that can be easily segmented into distinct components, such as walls, roofs, and rooms, reflecting the semantic essence of the shape. Empirical results on a comprehensive house dataset validate the superior accuracy, novelty, and semantic fidelity of our model compared to existing generative models. More results and details can be found on https://vcc.tech/research/2024/3DWire.
