Table of Contents
Fetching ...

Leafy Spurge Dataset: Real-world Weed Classification Within Aerial Drone Imagery

Kyle Doherty, Max Gurinas, Erik Samsoe, Charles Casper, Beau Larkin, Philip Ramsey, Brandon Trabucco, Ruslan Salakhutdinov

TL;DR

The Leafy Spurge Dataset introduces a real-world, drone-imagery benchmark for presence/absence classification of the invasive weed Euphorbia esula in western Montana. It combines high-resolution aerial imagery (8241 images over 118 ha) with precise ground-truth labels and benchmarks two backbones, ResNet50 and DINOv2, on two image scales, revealing that small crops yield strong results (0.84 accuracy) and vision transformers excel with larger context (0.74 accuracy). The dataset demonstrates tractability but not full solvability for weed classification in wildlands and is released to advance ecology, remote sensing, and foundation-model research, including few-shot and unsupervised learning with unlabelled data. This resource supports development of targeted weed management and promotes exploration of multi-modal and foundation-model-driven approaches in real-world remote sensing tasks.

Abstract

Invasive plant species are detrimental to the ecology of both agricultural and wildland areas. Euphorbia esula, or leafy spurge, is one such plant that has spread through much of North America from Eastern Europe. When paired with contemporary computer vision systems, unmanned aerial vehicles, or drones, offer the means to track expansion of problem plants, such as leafy spurge, and improve chances of controlling these weeds. We gathered a dataset of leafy spurge presence and absence in grasslands of western Montana, USA, then surveyed these areas with a commercial drone. We trained image classifiers on these data, and our best performing model, a pre-trained DINOv2 vision transformer, identified leafy spurge with 0.84 accuracy (test set). This result indicates that classification of leafy spurge is tractable, but not solved. We release this unique dataset of labelled and unlabelled, aerial drone imagery for the machine learning community to explore. Improving classification performance of leafy spurge would benefit the fields of ecology, conservation, and remote sensing alike. Code and data are available at our website: leafy-spurge-dataset.github.io.

Leafy Spurge Dataset: Real-world Weed Classification Within Aerial Drone Imagery

TL;DR

The Leafy Spurge Dataset introduces a real-world, drone-imagery benchmark for presence/absence classification of the invasive weed Euphorbia esula in western Montana. It combines high-resolution aerial imagery (8241 images over 118 ha) with precise ground-truth labels and benchmarks two backbones, ResNet50 and DINOv2, on two image scales, revealing that small crops yield strong results (0.84 accuracy) and vision transformers excel with larger context (0.74 accuracy). The dataset demonstrates tractability but not full solvability for weed classification in wildlands and is released to advance ecology, remote sensing, and foundation-model research, including few-shot and unsupervised learning with unlabelled data. This resource supports development of targeted weed management and promotes exploration of multi-modal and foundation-model-driven approaches in real-world remote sensing tasks.

Abstract

Invasive plant species are detrimental to the ecology of both agricultural and wildland areas. Euphorbia esula, or leafy spurge, is one such plant that has spread through much of North America from Eastern Europe. When paired with contemporary computer vision systems, unmanned aerial vehicles, or drones, offer the means to track expansion of problem plants, such as leafy spurge, and improve chances of controlling these weeds. We gathered a dataset of leafy spurge presence and absence in grasslands of western Montana, USA, then surveyed these areas with a commercial drone. We trained image classifiers on these data, and our best performing model, a pre-trained DINOv2 vision transformer, identified leafy spurge with 0.84 accuracy (test set). This result indicates that classification of leafy spurge is tractable, but not solved. We release this unique dataset of labelled and unlabelled, aerial drone imagery for the machine learning community to explore. Improving classification performance of leafy spurge would benefit the fields of ecology, conservation, and remote sensing alike. Code and data are available at our website: leafy-spurge-dataset.github.io.
Paper Structure (11 sections, 4 figures, 1 table)

This paper contains 11 sections, 4 figures, 1 table.

Figures (4)

  • Figure 1: A sagebrush community invaded by leafy spurge (left) and a closeup of a leafy spurge inflorescence (right).
  • Figure 2: Left: A map of the drone survey flight plan where each point represents the coordinates of a photo in the survey, colored by time of photo. Right: These overlapping images are processed into a large, georeferenced orthomosaic (right).
  • Figure 3: Left: Locations of leafy spurge ground truth samples. Point colors indicate sampling site identity. Right: The red box indicates the dimensions of a leafy spurge ground truth sample (0.5 x 0.5m).
  • Figure 4: We present classification results for the Leafy Spurge dataset when training on 39x39 pixel (left panels) and 1024x1024 pixel (right panels) images. We tested two architectures: ResNet50 (orange) and DINOv2 (blue). We report training (A) and validation (B) set accuracy. We represent cross-validated means as lines and 95% confidence intervals as bands.