Table of Contents
Fetching ...

CitDet: A Benchmark Dataset for Citrus Fruit Detection

Jordan A. James, Heather K. Manching, Matthew R. Mattia, Kim D. Bowman, Amanda M. Hulse-Kemp, William J. Beksi

TL;DR

CitDet introduces a high-resolution benchmark dataset for citrus fruit detection in orchards affected by Huanglongbing (HLB), featuring bounding-box annotations for fruit on trees and on the ground across 579 images and 32,448 boxes, plus tree-level yield metadata. The authors benchmark several state-of-the-art detectors on full-resolution and tiled-image variants, finding tiling improves detection performance and that YOLOv7 generally outperforms others, though Faster R-CNN remains strong for small objects. They demonstrate yield estimation from detections using front/back views, reporting substantial correlations with ground-truth yields, especially with the filter-detect-count method (per-tree $R^2$ up to $0.793$). CitDet thus provides a robust resource for robust citrus detection, yield mapping, and HLB impact assessment in real orchard conditions, enabling fairer cross-method comparisons and toward automated harvesting and monitoring.

Abstract

In this letter, we present a new dataset to advance the state of the art in detecting citrus fruit and accurately estimate yield on trees affected by the Huanglongbing (HLB) disease in orchard environments via imaging. Despite the fact that significant progress has been made in solving the fruit detection problem, the lack of publicly available datasets has complicated direct comparison of results. For instance, citrus detection has long been of interest to the agricultural research community, yet there is an absence of work, particularly involving public datasets of citrus affected by HLB. To address this issue, we enhance state-of-the-art object detection methods for use in typical orchard settings. Concretely, we provide high-resolution images of citrus trees located in an area known to be highly affected by HLB, along with high-quality bounding box annotations of citrus fruit. Fruit on both the trees and the ground are labeled to allow for identification of fruit location, which contributes to advancements in yield estimation and potential measure of HLB impact via fruit drop. The dataset consists of over 32,000 bounding box annotations for fruit instances contained in 579 high-resolution images. In summary, our contributions are the following: (i) we introduce a novel dataset along with baseline performance benchmarks on multiple contemporary object detection algorithms, (ii) we show the ability to accurately capture fruit location on tree or on ground, and finally (ii) we present a correlation of our results with yield estimations.

CitDet: A Benchmark Dataset for Citrus Fruit Detection

TL;DR

CitDet introduces a high-resolution benchmark dataset for citrus fruit detection in orchards affected by Huanglongbing (HLB), featuring bounding-box annotations for fruit on trees and on the ground across 579 images and 32,448 boxes, plus tree-level yield metadata. The authors benchmark several state-of-the-art detectors on full-resolution and tiled-image variants, finding tiling improves detection performance and that YOLOv7 generally outperforms others, though Faster R-CNN remains strong for small objects. They demonstrate yield estimation from detections using front/back views, reporting substantial correlations with ground-truth yields, especially with the filter-detect-count method (per-tree up to ). CitDet thus provides a robust resource for robust citrus detection, yield mapping, and HLB impact assessment in real orchard conditions, enabling fairer cross-method comparisons and toward automated harvesting and monitoring.

Abstract

In this letter, we present a new dataset to advance the state of the art in detecting citrus fruit and accurately estimate yield on trees affected by the Huanglongbing (HLB) disease in orchard environments via imaging. Despite the fact that significant progress has been made in solving the fruit detection problem, the lack of publicly available datasets has complicated direct comparison of results. For instance, citrus detection has long been of interest to the agricultural research community, yet there is an absence of work, particularly involving public datasets of citrus affected by HLB. To address this issue, we enhance state-of-the-art object detection methods for use in typical orchard settings. Concretely, we provide high-resolution images of citrus trees located in an area known to be highly affected by HLB, along with high-quality bounding box annotations of citrus fruit. Fruit on both the trees and the ground are labeled to allow for identification of fruit location, which contributes to advancements in yield estimation and potential measure of HLB impact via fruit drop. The dataset consists of over 32,000 bounding box annotations for fruit instances contained in 579 high-resolution images. In summary, our contributions are the following: (i) we introduce a novel dataset along with baseline performance benchmarks on multiple contemporary object detection algorithms, (ii) we show the ability to accurately capture fruit location on tree or on ground, and finally (ii) we present a correlation of our results with yield estimations.
Paper Structure (14 sections, 1 equation, 7 figures, 4 tables)

This paper contains 14 sections, 1 equation, 7 figures, 4 tables.

Figures (7)

  • Figure 1: The CitDet dataset contains precise bounding box object annotations for fruit on tree (purple) and fruit on ground (yellow) (top row). It also has images from multiple different tree rows including a large variety of citrus (bottom row).
  • Figure 2: The area distribution of the objects in CitDet. The dataset is mainly comprised of small object instances with an area of less than 50$^2$ pixels.
  • Figure 3: A comparison of the relative instance size (square root of instance area divided by total image area) between fruit detection datasets.
  • Figure 4: The distribution of object centers in normalized image coordinates for four datasets. CitDet exhibits the greatest spatial diversity among the citrus datasets, with the tiled version exhibiting greater complexity than all the other fruit datasets.
  • Figure 5: Visualizations of the qualitative results for each network on the same full validation image. The bounding box annotations are colored based on the predicted class label with differing color schemes for each network.
  • ...and 2 more figures