CoPeD-Advancing Multi-Robot Collaborative Perception: A Comprehensive Dataset in Real-World Environments
Yang Zhou, Long Quang, Carlos Nieto-Granda, Giuseppe Loianno
TL;DR
CoPeD addresses the lack of real-world multi-robot collaborative perception data by providing a heterogeneous air-ground dataset with multiple modalities and synchronized pose data. It introduces a comprehensive hardware setup, indoor and outdoor sequences, calibration pipelines, and zero-shot annotation pipelines to support a range of tasks from depth/semantic estimation to sensor fusion. The dataset enables robust cross-robot collaboration studies and goes beyond BEV-focused scenarios by leveraging air-ground viewpoints. This resource is positioned to accelerate research in robust, real-world multi-robot perception and autonomy.
Abstract
In the past decade, although single-robot perception has made significant advancements, the exploration of multi-robot collaborative perception remains largely unexplored. This involves fusing compressed, intermittent, limited, heterogeneous, and asynchronous environmental information across multiple robots to enhance overall perception, despite challenges like sensor noise, occlusions, and sensor failures. One major hurdle has been the lack of real-world datasets. This paper presents a pioneering and comprehensive real-world multi-robot collaborative perception dataset to boost research in this area. Our dataset leverages the untapped potential of air-ground robot collaboration featuring distinct spatial viewpoints, complementary robot mobilities, coverage ranges, and sensor modalities. It features raw sensor inputs, pose estimation, and optional high-level perception annotation, thus accommodating diverse research interests. Compared to existing datasets predominantly designed for Simultaneous Localization and Mapping (SLAM), our setup ensures a diverse range and adequate overlap of sensor views to facilitate the study of multi-robot collaborative perception algorithms. We demonstrate the value of this dataset qualitatively through multiple collaborative perception tasks. We believe this work will unlock the potential research of high-level scene understanding through multi-modal collaborative perception in multi-robot settings.
