EMT: A Visual Multi-Task Benchmark Dataset for Autonomous Driving

Nadya Abdel Madjid; Murad Mebrahtu; Abdulrahman Ahmad; Abdelmoamen Nasser; Bilal Hassan; Naoufel Werghi; Jorge Dias; Majid Khonji

EMT: A Visual Multi-Task Benchmark Dataset for Autonomous Driving

Nadya Abdel Madjid, Murad Mebrahtu, Abdulrahman Ahmad, Abdelmoamen Nasser, Bilal Hassan, Naoufel Werghi, Jorge Dias, Majid Khonji

TL;DR

EMT addresses the need for a region-specific, multi-task benchmark in autonomous driving by providing a unified visual dataset that supports tracking, trajectory forecasting, and intention prediction. The authors systematically evaluate multiple task-specific models and cross-task dependencies, offering baseline detectors, trackers, and predictors across three complementary benchmarks collected in the UAE. Key contributions include three aligned task datasets with extensive annotations (over 570k bounding boxes across ~20 videos), diverse driving scenarios, and robust evaluation protocols, enabling cross-task analysis and generalization to underrepresented regions. The work advances practical autonomous driving research by enabling region-aware model development and cross-task assessment, while outlining directions for Sim2Real and multimodal extensions to further enhance safety and reliability in Gulf-region traffic.

Abstract

This paper introduces the Emirates Multi-Task (EMT) dataset, designed to support multi-task benchmarking within a unified framework. It comprises over 30,000 frames from a dash-camera perspective and 570,000 annotated bounding boxes, covering approximately 150 kilometers of driving routes that reflect the distinctive road topology, congestion patterns, and driving behavior of Gulf region traffic. The dataset supports three primary tasks: tracking, trajectory forecasting, and intention prediction. Each benchmark is accompanied by corresponding evaluations: (1) multi-agent tracking experiments addressing multi-class scenarios and occlusion handling; (2) trajectory forecasting evaluation using deep sequential and interaction-aware models; and (3) intention prediction experiments based on observed trajectories. The dataset is publicly available at https://avlab.io/emt-dataset, with pre-processing scripts and evaluation models at https://github.com/AV-Lab/emt-dataset.

EMT: A Visual Multi-Task Benchmark Dataset for Autonomous Driving

TL;DR

Abstract

EMT: A Visual Multi-Task Benchmark Dataset for Autonomous Driving

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)