Survey on Datasets for Perception in Unstructured Outdoor Environments
Peter Mortimer, Mirko Maehlisch
TL;DR
This survey maps the landscape of publicly available perception datasets for unstructured outdoor environments in field robotics, focusing on 2D/3D sensing, synthetic data, terrain, forestry, place recognition, and robot platforms. It analyzes dataset characteristics, annotation policies, and sensor layouts to understand biases and scalability, highlighting trends toward larger, more granular annotations and the scarcity of cross-task benchmarks. The authors discuss the need for standardized evaluation servers and ontologies to harmonize labels across datasets, along with the potential of synthetic data and foundation models to reduce labeling effort. The work serves as a practical guide for practitioners to select appropriate datasets and identifies opportunities to improve interoperability and reproducibility in field robotics research.
Abstract
Perception is an essential component of pipelines in field robotics. In this survey, we quantitatively compare publicly available datasets available in unstructured outdoor environments. We focus on datasets for common perception tasks in field robotics. Our survey categorizes and compares available research datasets. This survey also reports on relevant dataset characteristics to help practitioners determine which dataset fits best for their own application. We believe more consideration should be taken in choosing compatible annotation policies across the datasets in unstructured outdoor environments.
