AnyHome: Open-Vocabulary Generation of Structured and Textured 3D Homes
Rao Fu, Zehao Wen, Zichen Liu, Srinath Sridhar
TL;DR
AnyHome presents a two-stage, text-controlled pipeline that translates open-vocabulary text into house-scale 3D indoor scenes with structured geometry and textured realism. By combining LLM-driven modular descriptions, amodal hierarchical geometry, a graph-based floorplan and room-layout generation, SDS-based refinement, and egocentric texture inpainting, the method achieves robust open-vocabulary generation and editing while maintaining structural coherence. The approach demonstrates strong improvements over baselines in layout quality and texture consistency, enabling diverse, editable interiors for interior design, gaming, AR/VR, and embodied-agent training. This work advances open-vocabulary 3D scene synthesis by integrating language-driven planning, graph-based control, and view-consistent texture generation, paving the way for scalable, richly textured, navigable indoor environments.
Abstract
Inspired by cognitive theories, we introduce AnyHome, a framework that translates any text into well-structured and textured indoor scenes at a house-scale. By prompting Large Language Models (LLMs) with designed templates, our approach converts provided textual narratives into amodal structured representations. These representations guarantee consistent and realistic spatial layouts by directing the synthesis of a geometry mesh within defined constraints. A Score Distillation Sampling process is then employed to refine the geometry, followed by an egocentric inpainting process that adds lifelike textures to it. AnyHome stands out with its editability, customizability, diversity, and realism. The structured representations for scenes allow for extensive editing at varying levels of granularity. Capable of interpreting texts ranging from simple labels to detailed narratives, AnyHome generates detailed geometries and textures that outperform existing methods in both quantitative and qualitative measures.
