MITO: A Millimeter-Wave Dataset and Simulator for Non-Line-of-Sight Perception
Laura Dodds, Tara Boroushaki, Cusuh Ham, Fadel Adib
TL;DR
MITO addresses the core problem that optical perception fails under occlusion by introducing a large mmWave LOS/NLOS dataset and an open-source SAR-based simulator. The approach combines real captures with two mmWave frequencies and an RGB-D sensor, plus a simulator that renders synthetic mmWave images from 3D meshes to support training and benchmarking. The key contributions are over 24 million real mmWave frames producing 550 high-resolution SAR images across 76 YCB objects, ground-truth segmentation, and a multi-reflection simulator that merges specular and edge reflections; benchmarks for NLOS segmentation and Sim2Real shape classification are demonstrated. The work significantly advances mmWave perception for robotics and logistics by providing a scalable, multimodal dataset, realistic simulation tools, and practical baselines for NLOS tasks.
Abstract
The ability to observe the world is fundamental to reasoning and making informed decisions on how to interact with the environment. However, optical perception can often be disrupted due to common occurrences, such as occlusions, which can pose challenges to existing vision systems. We present MITO, the first millimeter-wave (mmWave) dataset of diverse, everyday objects, collected using a UR5 robotic arm with two mmWave radars operating at different frequencies and an RGB-D camera. Unlike visible light, mmWave signals can penetrate common occlusions (e.g., cardboard boxes, fabric, plastic) but each mmWave frame has much lower resolution than typical cameras. To capture higher-resolution mmWave images, we leverage the robot's mobility and fuse frames over the synthesized aperture. MITO captures over 24 million mmWave frames and uses them to generate 550 high-resolution mmWave (synthetic aperture) images in line-of-sight and non-light-of-sight (NLOS), as well as RGB-D images, segmentation masks, and raw mmWave signals, taken from 76 different objects. We develop an open-source simulation tool that can be used to generate synthetic mmWave images for any 3D triangle mesh. Finally, we demonstrate the utility of our dataset and simulator for enabling broader NLOS perception by developing benchmarks for NLOS segmentation and classification.
