Self-Supervised Data Generation for Precision Agriculture: Blending Simulated Environments with Real Imagery
Leonardo Saraceni, Ionut Marian Motoi, Daniele Nardi, Thomas Alessandro Ciarfuglia
TL;DR
The paper tackles the challenge of labeled-data scarcity and covariate shift in precision agriculture by blending real-world segmentation masks with photo-realistic synthetic vineyard imagery. It introduces a Unity-based CANOPIES vineyard simulator to generate synthetic data and uses real masks obtained via a YOLOv5-based detector and SAM, then pastes them onto synthetic images with PCA-guided alignment to create diverse, labeled samples. Empirical results on grape detection show that blending real instances into synthetic scenes (SyntheticPasted) and combining with pseudo-labeled real data yields the best performance, improving robustness to occlusions and illumination changes. The approach offers an automated, scalable data-generation pipeline suitable for adoption by farmers and adaptable to other crops.
Abstract
In precision agriculture, the scarcity of labeled data and significant covariate shifts pose unique challenges for training machine learning models. This scarcity is particularly problematic due to the dynamic nature of the environment and the evolving appearance of agricultural subjects as living things. We propose a novel system for generating realistic synthetic data to address these challenges. Utilizing a vineyard simulator based on the Unity engine, our system employs a cut-and-paste technique with geometrical consistency considerations to produce accurate photo-realistic images and labels from synthetic environments to train detection algorithms. This approach generates diverse data samples across various viewpoints and lighting conditions. We demonstrate considerable performance improvements in training a state-of-the-art detector by applying our method to table grapes cultivation. The combination of techniques can be easily automated, an increasingly important consideration for adoption in agricultural practice.
