EfficientDreamer: High-Fidelity and Robust 3D Creation via Orthogonal-view Diffusion Prior
Zhipeng Hu, Minda Zhao, Chaoyi Zhao, Xinyue Liang, Lincheng Li, Zeng Zhao, Changjie Fan, Xiaowei Zhou, Xin Yu
TL;DR
EfficientDreamer tackles the Janus problem in text-to-3D by introducing an orthogonal-view diffusion prior that renders four mutually consistent orthogonal-view sub-images. A 3D synthesis fusion network then blends this prior with a pre-trained 2D diffusion prior under a dynamic Score Distillation Sampling strategy to progressively favor geometry and then texture. The method is trained on Objaverse-rendered composites and evaluated against state-of-the-art text-to-3D approaches, showing superior geometric consistency, photorealistic textures, and user-preferred results. The two-stage geometry-then-texture optimization, combined with ablation studies, demonstrates the value of orthogonal-view guidance for robust 3D content creation.
Abstract
While image diffusion models have made significant progress in text-driven 3D content creation, they often fail to accurately capture the intended meaning of text prompts, especially for view information. This limitation leads to the Janus problem, where multi-faced 3D models are generated under the guidance of such diffusion models. In this paper, we propose a robust high-quality 3D content generation pipeline by exploiting orthogonal-view image guidance. First, we introduce a novel 2D diffusion model that generates an image consisting of four orthogonal-view sub-images based on the given text prompt. Then, the 3D content is created using this diffusion model. Notably, the generated orthogonal-view image provides strong geometric structure priors and thus improves 3D consistency. As a result, it effectively resolves the Janus problem and significantly enhances the quality of 3D content creation. Additionally, we present a 3D synthesis fusion network that can further improve the details of the generated 3D contents. Both quantitative and qualitative evaluations demonstrate that our method surpasses previous text-to-3D techniques. Project page: https://efficientdreamer.github.io.
