MetaDecorator: Generating Immersive Virtual Tours through Multimodality
Shuang Xie, Yang Liu, Jeannie S. A. Lee, Haiwei Dong
TL;DR
MetaDecorator tackles the limitations of fixed 3D virtual tours by enabling personalized, text-guided decorating of 360° panoramas and subsequent geometry-aware 3D reconstruction. The method combines diffusion-based 2D decoration with ControlNet-guided geometry cues and a DP-NeRF pipeline to produce high-quality, render-efficient polygonal meshes suitable for VR and metaverse applications. A green AI enhancement via DP-NeRF accelerates training by leveraging a depth-informed occupancy grid and depth/RGB constraints, achieving ~10× faster training while maintaining competitive quality. The work also points to future directions involving LLM-driven user interaction and haptic textures to further enrich immersive experiences.
Abstract
MetaDecorator, is a framework that empowers users to personalize virtual spaces. By leveraging text-driven prompts and image synthesis techniques, MetaDecorator adorns static panoramas captured by 360° imaging devices, transforming them into uniquely styled and visually appealing environments. This significantly enhances the realism and engagement of virtual tours compared to traditional offerings. Beyond the core framework, we also discuss the integration of Large Language Models (LLMs) and haptics in the VR application to provide a more immersive experience.
