Object-Centric 3D Gaussian Splatting for Strawberry Plant Reconstruction and Phenotyping
Jiajia Li, Keyi Zhu, Qianwen Zhang, Dong Chen, Qi Sun, Zhaojian Li
TL;DR
This work tackles the challenge of non-destructive, high-throughput strawberry phenotyping by developing an object-centric 3D Gaussian Splatting (3DGS) framework that uses SAM-2 foreground masks to suppress background noise during reconstruction. The pipeline integrates RGBA-based loss masking, opacity-guided Gaussian culling, and background randomization, with scale calibration via a known calibration cube and trait extraction through DBSCAN clustering and PCA to obtain plant height and canopy width. Compared with NeRF-based baselines, the method delivers superior reconstruction quality (e.g., PSNR and SSIM) while reducing training time and memory usage, and enables centimeter-level accuracy in key traits, facilitating automated, scalable phenotyping. The approach holds significant practical impact for agricultural monitoring and breeding, with potential extensions to other crops and field-scale deployments through hierarchical or distributed reconstruction strategies.
Abstract
Strawberries are among the most economically significant fruits in the United States, generating over $2 billion in annual farm-gate sales and accounting for approximately 13% of the total fruit production value. Plant phenotyping plays a vital role in selecting superior cultivars by characterizing plant traits such as morphology, canopy structure, and growth dynamics. However, traditional plant phenotyping methods are time-consuming, labor-intensive, and often destructive. Recently, neural rendering techniques, notably Neural Radiance Fields (NeRF) and 3D Gaussian Splatting (3DGS), have emerged as powerful frameworks for high-fidelity 3D reconstruction. By capturing a sequence of multi-view images or videos around a target plant, these methods enable non-destructive reconstruction of complex plant architectures. Despite their promise, most current applications of 3DGS in agricultural domains reconstruct the entire scene, including background elements, which introduces noise, increases computational costs, and complicates downstream trait analysis. To address this limitation, we propose a novel object-centric 3D reconstruction framework incorporating a preprocessing pipeline that leverages the Segment Anything Model v2 (SAM-2) and alpha channel background masking to achieve clean strawberry plant reconstructions. This approach produces more accurate geometric representations while substantially reducing computational time. With a background-free reconstruction, our algorithm can automatically estimate important plant traits, such as plant height and canopy width, using DBSCAN clustering and Principal Component Analysis (PCA). Experimental results show that our method outperforms conventional pipelines in both accuracy and efficiency, offering a scalable and non-destructive solution for strawberry plant phenotyping.
