Table of Contents
Fetching ...

WeedsGalore: A Multispectral and Multitemporal UAV-based Dataset for Crop and Weed Segmentation in Agricultural Maize Fields

Ekin Celikkan, Timo Kunzmann, Yertay Yeskaliyev, Sibylle Itzerott, Nadja Klein, Martin Herold

TL;DR

Weed management in maize is hindered by limited, domain-specific, annotated data for UAV-based weed segmentation. We present WeedsGalore, a multispectral, multitemporal UAV dataset with five bands and dense semantic and instance annotations for crops and four weed classes, tailored to maize fields. Baseline experiments with DeepLabv3+ and MaskFormer show that multispectral information improves segmentation, especially for challenging weed classes, and probabilistic inference with MC dropout provides reliable uncertainty estimates and better calibration. The dataset's generalization to unseen data is demonstrated on Maize2024, where models trained on WeedsGalore achieve substantial improvement over baselines, underscoring the practical value for UAV-based weeding and monitoring systems; the dataset and code are publicly available at https://github.com/GFZ/weedsgalore.

Abstract

Weeds are one of the major reasons for crop yield loss but current weeding practices fail to manage weeds in an efficient and targeted manner. Effective weed management is especially important for crops with high worldwide production such as maize, to maximize crop yield for meeting increasing global demands. Advances in near-sensing and computer vision enable the development of new tools for weed management. Specifically, state-of-the-art segmentation models, coupled with novel sensing technologies, can facilitate timely and accurate weeding and monitoring systems. However, learning-based approaches require annotated data and show a lack of generalization to aerial imaging for different crops. We present a novel dataset for semantic and instance segmentation of crops and weeds in agricultural maize fields. The multispectral UAV-based dataset contains images with RGB, red-edge, and near-infrared bands, a large number of plant instances, dense annotations for maize and four weed classes, and is multitemporal. We provide extensive baseline results for both tasks, including probabilistic methods to quantify prediction uncertainty, improve model calibration, and demonstrate the approach's applicability to out-of-distribution data. The results show the effectiveness of the two additional bands compared to RGB only, and better performance in our target domain than models trained on existing datasets. We hope our dataset advances research on methods and operational systems for fine-grained weed identification, enhancing the robustness and applicability of UAV-based weed management. The dataset and code are available at https://github.com/GFZ/weedsgalore

WeedsGalore: A Multispectral and Multitemporal UAV-based Dataset for Crop and Weed Segmentation in Agricultural Maize Fields

TL;DR

Weed management in maize is hindered by limited, domain-specific, annotated data for UAV-based weed segmentation. We present WeedsGalore, a multispectral, multitemporal UAV dataset with five bands and dense semantic and instance annotations for crops and four weed classes, tailored to maize fields. Baseline experiments with DeepLabv3+ and MaskFormer show that multispectral information improves segmentation, especially for challenging weed classes, and probabilistic inference with MC dropout provides reliable uncertainty estimates and better calibration. The dataset's generalization to unseen data is demonstrated on Maize2024, where models trained on WeedsGalore achieve substantial improvement over baselines, underscoring the practical value for UAV-based weeding and monitoring systems; the dataset and code are publicly available at https://github.com/GFZ/weedsgalore.

Abstract

Weeds are one of the major reasons for crop yield loss but current weeding practices fail to manage weeds in an efficient and targeted manner. Effective weed management is especially important for crops with high worldwide production such as maize, to maximize crop yield for meeting increasing global demands. Advances in near-sensing and computer vision enable the development of new tools for weed management. Specifically, state-of-the-art segmentation models, coupled with novel sensing technologies, can facilitate timely and accurate weeding and monitoring systems. However, learning-based approaches require annotated data and show a lack of generalization to aerial imaging for different crops. We present a novel dataset for semantic and instance segmentation of crops and weeds in agricultural maize fields. The multispectral UAV-based dataset contains images with RGB, red-edge, and near-infrared bands, a large number of plant instances, dense annotations for maize and four weed classes, and is multitemporal. We provide extensive baseline results for both tasks, including probabilistic methods to quantify prediction uncertainty, improve model calibration, and demonstrate the approach's applicability to out-of-distribution data. The results show the effectiveness of the two additional bands compared to RGB only, and better performance in our target domain than models trained on existing datasets. We hope our dataset advances research on methods and operational systems for fine-grained weed identification, enhancing the robustness and applicability of UAV-based weed management. The dataset and code are available at https://github.com/GFZ/weedsgalore

Paper Structure

This paper contains 17 sections, 1 equation, 7 figures, 7 tables.

Figures (7)

  • Figure 1: WeedsGalore dataset. We present a novel reference dataset for UAV-based weed monitoring for maize fields. The dataset a) contains images with 5-bands (RGB, near-infrared, red-edge) and b) is recorded at different growth stages, fully reflecting real-world agricultural practices. c) We provide detailed pixel-level annotations for semantic (multiple weed classes) and instance segmentation.
  • Figure 2: Change in plant cover over the acquisition timeline. Examples from four different dates and semantic masks (maize, amaranth, barnyard grass, quickweed, weed other). Best viewed on a colored screen, and zoomed in.
  • Figure 3: Ortohomosaic of the full field and georeferenced locations of annotated images. Points show captured image locations, while their colors encode acquisition dates. The dataset splits are spatially separated by patches into train, validation, and test. Smaller polygons are drawn around the annotated samples: The multitemporal dataset contains samples from the same locations from all dates. Best viewed on screen and zoomed in.
  • Figure 4: Normalized confusion matrices for semantic segmentation. Scores reported for MSI input and DeepLabv3+ model, and uni-weed (left) and multi-weed class cases (right).
  • Figure 5: Qualitative results for probabilistic crop and weed segmentation for MSI input. The uncertainty is high in regions that are misclassified, which is a desired and useful information.
  • ...and 2 more figures