Robust soybean seed yield estimation using high-throughput ground robot videos
Jiale Feng, Samuel W. Blair, Timilehin Ayanlade, Aditya Balu, Baskar Ganapathysubramanian, Arti Singh, Soumik Sarkar, Asheesh K Singh
TL;DR
This work tackles the high cost and labor intensity of soybean yield data collection by leveraging ground-robot video data and deep learning for seed counting and yield estimation. It introduces P2PNet-Yield, a two-stage architecture that uses a P2PNet-Soy backbone for seed counting and a regression head to estimate plot yield, trained on a three-year, large-field dataset collected with fisheye cameras. Key innovations include fisheye distortion correction, data augmentation with camera sensor effects, and spatial adjustment to account for environmental variation, achieving strong genotype ranking performance and notable time/cost reductions. The approach demonstrates practical utility for plant breeding programs, enabling scalable, high-throughput yield estimation and ranking, with potential integration into broader phenotyping pipelines and field networks.
Abstract
We present a novel method for soybean (Glycine max (L.) Merr.) yield estimation leveraging high throughput seed counting via computer vision and deep learning techniques. Traditional methods for collecting yield data are labor-intensive, costly, prone to equipment failures at critical data collection times, and require transportation of equipment across field sites. Computer vision, the field of teaching computers to interpret visual data, allows us to extract detailed yield information directly from images. By treating it as a computer vision task, we report a more efficient alternative, employing a ground robot equipped with fisheye cameras to capture comprehensive videos of soybean plots from which images are extracted in a variety of development programs. These images are processed through the P2PNet-Yield model, a deep learning framework where we combined a Feature Extraction Module (the backbone of the P2PNet-Soy) and a Yield Regression Module to estimate seed yields of soybean plots. Our results are built on three years of yield testing plot data - 8500 in 2021, 2275 in 2022, and 650 in 2023. With these datasets, our approach incorporates several innovations to further improve the accuracy and generalizability of the seed counting and yield estimation architecture, such as the fisheye image correction and data augmentation with random sensor effects. The P2PNet-Yield model achieved a genotype ranking accuracy score of up to 83%. It demonstrates up to a 32% reduction in time to collect yield data as well as costs associated with traditional yield estimation, offering a scalable solution for breeding programs and agricultural productivity enhancement.
