3DBonsai: Structure-Aware Bonsai Modeling Using Conditioned 3D Gaussian Splatting
Hao Wu, Hao Wang, Ruochong Li, Xuran Ma, Hui Xiong
TL;DR
3DBonsai tackles the problem of generating complex 3D bonsai by introducing a trainable 3D Space Colonization Algorithm to create structure priors and two structure-aware 3D Gaussian Splatting pipelines (fine-structure and coarse-structure) guided by 2D diffusion. The method couples 3D priors with SDS-based text guidance and depth-informed multi-view consistency, enabling high-fidelity, structure-aware 3D bonsai generation. A Chinese-style bonsai dataset supports training and benchmarking, and extensive experiments show improvements in both perceptual quality and 3D consistency, with favorable CLIP scores and human preferences. The work establishes a new benchmark for structure-aware 3D bonsai generation and highlights remaining challenges in rendering extremely complex forms due to view-limited diffusion guidance.
Abstract
Recent advancements in text-to-3D generation have shown remarkable results by leveraging 3D priors in combination with 2D diffusion. However, previous methods utilize 3D priors that lack detailed and complex structural information, limiting them to generating simple objects and presenting challenges for creating intricate structures such as bonsai. In this paper, we propose 3DBonsai, a novel text-to-3D framework for generating 3D bonsai with complex structures. Technically, we first design a trainable 3D space colonization algorithm to produce bonsai structures, which are then enhanced through random sampling and point cloud augmentation to serve as the 3D Gaussian priors. We introduce two bonsai generation pipelines with distinct structural levels: fine structure conditioned generation, which initializes 3D Gaussians using a 3D structure prior to produce detailed and complex bonsai, and coarse structure conditioned generation, which employs a multi-view structure consistency module to align 2D and 3D structures. Moreover, we have compiled a unified 2D and 3D Chinese-style bonsai dataset. Our experimental results demonstrate that 3DBonsai significantly outperforms existing methods, providing a new benchmark for structure-aware 3D bonsai generation.
