Tree-D Fusion: Simulation-Ready Tree Dataset from Single Images with Diffusion Priors
Jae Joong Lee, Bosheng Li, Sara Beery, Jonathan Huang, Songlin Fei, Raymond A. Yeh, Bedrich Benes
TL;DR
Tree-D Fusion tackles the scarcity of large-scale, realistic 3D tree data by reconstructing simulation-ready trees from single images using genus-conditioned diffusion priors and a developmental space-colonization model. It trains 2D priors on Auto Arborist images and a 3D prior on synthetic trees to produce a detailed 3D envelope, which is expanded into a full branching structure. The approach achieves state-of-the-art realism and geometric fidelity across 600k models, enabling scalable forestry analysis, urban planning, and AR visualization, while maintaining the ability to simulate growth and environmental interactions. Limitations include sensitivity to asymmetric shapes and leaf occlusion, with future work targeting broader genus coverage and improved occlusion handling.
Abstract
We introduce Tree D-fusion, featuring the first collection of 600,000 environmentally aware, 3D simulation-ready tree models generated through Diffusion priors. Each reconstructed 3D tree model corresponds to an image from Google's Auto Arborist Dataset, comprising street view images and associated genus labels of trees across North America. Our method distills the scores of two tree-adapted diffusion models by utilizing text prompts to specify a tree genus, thus facilitating shape reconstruction. This process involves reconstructing a 3D tree envelope filled with point markers, which are subsequently utilized to estimate the tree's branching structure using the space colonization algorithm conditioned on a specified genus.
