Walnut Detection Through Deep Learning Enhanced by Multispectral Synthetic Images
Kaiming Fu, Tong Lei, Maryia Halubok, Brian N. Bailey
TL;DR
The paper addresses the challenge of accurately detecting walnuts in orchards, where walnuts and leaves appear highly similar in RGB and NIR images. It proposes augmenting real data with synthetic images generated via a radiative-transfer-based Helios framework, using reverse ray tracing to label synthetic pixels, and trains YOLOv5 on the augmented RGB and NIR datasets. The results show substantial improvements in detection metrics for both RGB (AP rising from 73.89% to 82.68%; F1 from 72.31 to 80.56) and NIR (AP from 68.17% to 78.63%; F1 from 62.07 to 74.48), demonstrating the value of synthetic data in agricultural image analysis. The work suggests future development of a unified RGB-NIR model and scaling synthetic data to reduce dependence on real images, with practical implications for yield estimation and orchard management.
Abstract
The accurate identification of walnuts within orchards brings forth a plethora of advantages, profoundly amplifying the efficiency and productivity of walnut orchard management. Nevertheless, the unique characteristics of walnut trees, characterized by their closely resembling shapes, colors, and textures between the walnuts and leaves, present a formidable challenge in precisely distinguishing between them during the annotation process. In this study, we present a novel approach to improve walnut detection efficiency, utilizing YOLOv5 trained on an enriched image set that incorporates both real and synthetic RGB and NIR images. Our analysis comparing results from our original and augmented datasets shows clear improvements in detection when using the synthetic images.
