MineInsight: A Multi-sensor Dataset for Humanitarian Demining Robotics in Off-Road Environments
Mario Malizia, Charles Hamesse, Ken Hasselmann, Geert De Cubber, Nikolaos Tsiogkas, Eric Demeester, Rob Haelterman
TL;DR
MineInsight tackles the scarcity of realistic, multi-modal datasets for off-road humanitarian demining by providing a public, multi-sensor, multi-spectral dataset with dual UGV and robotic-arm viewpoints, dual LiDARs, and data from RGB/monochrome, VIS-SWIR, and LWIR channels. It covers three vegetation-rich tracks with 35 targets (15 landmines, 20 distractors) captured over about one hour, incorporating automated bounding boxes refined by humans and minute climatology data. A key novelty is the targetless camera–LiDAR calibration and dual-view data fusion that mitigates occlusions, enabling robust evaluation of detection algorithms in cluttered environments. The dataset serves as a benchmark for domain adaptation and multi-modal fusion in realistic demining contexts while acknowledging seasonal and environmental domain gaps and annotation limitations.
Abstract
The use of robotics in humanitarian demining increasingly involves computer vision techniques to improve landmine detection capabilities. However, in the absence of diverse and realistic datasets, the reliable validation of algorithms remains a challenge for the research community. In this paper, we introduce MineInsight, a publicly available multi-sensor, multi-spectral dataset designed for off-road landmine detection. The dataset features 35 different targets (15 landmines and 20 commonly found objects) distributed along three distinct tracks, providing a diverse and realistic testing environment. MineInsight is, to the best of our knowledge, the first dataset to integrate dual-view sensor scans from both an Unmanned Ground Vehicle and its robotic arm, offering multiple viewpoints to mitigate occlusions and improve spatial awareness. It features two LiDARs, as well as images captured at diverse spectral ranges, including visible (RGB, monochrome), visible short-wave infrared (VIS-SWIR), and long-wave infrared (LWIR). Additionally, the dataset provides bounding boxes generated by an automated pipeline and refined with human supervision. We recorded approximately one hour of data in both daylight and nighttime conditions, resulting in around 38,000 RGB frames, 53,000 VIS-SWIR frames, and 108,000 LWIR frames. MineInsight serves as a benchmark for developing and evaluating landmine detection algorithms. Our dataset is available at https://github.com/mariomlz99/MineInsight.
