Detection of Spider Mites on Labrador Beans through Machine Learning Approaches Using Custom Datasets
Violet Liu, Jason Chen, Ans Qureshi, Mahla Nejati
TL;DR
This paper tackles early detection of spider mites on labrador beans by building an RGBN dataset from real greenhouse conditions and evaluating a two-stage pipeline that couples YOLOv8-based leaf segmentation with a sequential CNN classifier. It demonstrates that RGBN input generally improves classification performance and that a two-stage approach effectively handles partially labeled data, outperforming single-stage strategies on occluded datasets. Key contributions include novel RGBN data collection, grid-based segmentation to preserve leaf detail, and an analysis of channel fusion strategies and transfer learning, with the sequential CNN achieving $90.62\%$ validation accuracy on RGBN. The findings suggest that incorporating NIR data can enhance early disease detection and that modular, stage-wise processing improves robustness in practical agricultural settings, though larger datasets and more powerful hardware are needed for broader deployment.
Abstract
Amidst growing food production demands, early plant disease detection is essential to safeguard crops; this study proposes a visual machine learning approach for plant disease detection, harnessing RGB and NIR data collected in real-world conditions through a JAI FS-1600D-10GE camera to build an RGBN dataset. A two-stage early plant disease detection model with YOLOv8 and a sequential CNN was used to train on a dataset with partial labels, which showed a 3.6% increase in mAP compared to a single-stage end-to-end segmentation model. The sequential CNN model achieved 90.62% validation accuracy utilising RGBN data. An average of 6.25% validation accuracy increase is found using RGBN in classification compared to RGB using ResNet15 and the sequential CNN models. Further research and dataset improvements are needed to meet food production demands.
