Enhancing weed detection performance by means of GenAI-based image augmentation
Sourav Modak, Anthony Stein
TL;DR
This work addresses the data scarcity hindering robust real-time weed detection on edge devices by introducing GenAI-based image augmentation using a SAM-guided Stable Diffusion pipeline to generate diverse synthetic weed-crop scenes. The approach is evaluated on compact YOLO variants (YOLOv8n, YOLOv9t, YOLOv10-N) with and without COCO pretraining, showing improvements in $mAP_{50}$ and $mAP_{50-95}$ across models; notably, larger gains occur when training from scratch (e.g., $mAP_{50}$ gains of about 20–30 percentage points for some models). Traditional augmentations remain valuable, but synthetic data provides broader diversity and better generalization in heterogeneous field conditions, albeit with higher compute and quality-control requirements. The findings demonstrate the potential of synthetic data to reduce real-data annotation needs and enhance edge-ready weed detection, informing future hybrid augmentation strategies and efficiency optimizations for embedded deployment.
Abstract
Precise weed management is essential for sustaining crop productivity and ecological balance. Traditional herbicide applications face economic and environmental challenges, emphasizing the need for intelligent weed control systems powered by deep learning. These systems require vast amounts of high-quality training data. The reality of scarcity of well-annotated training data, however, is often addressed through generating more data using data augmentation. Nevertheless, conventional augmentation techniques such as random flipping, color changes, and blurring lack sufficient fidelity and diversity. This paper investigates a generative AI-based augmentation technique that uses the Stable Diffusion model to produce diverse synthetic images that improve the quantity and quality of training datasets for weed detection models. Moreover, this paper explores the impact of these synthetic images on the performance of real-time detection systems, thus focusing on compact CNN-based models such as YOLO nano for edge devices. The experimental results show substantial improvements in mean Average Precision (mAP50 and mAP50-95) scores for YOLO models trained with generative AI-augmented datasets, demonstrating the promising potential of synthetic data to enhance model robustness and accuracy.
