RDD4D: 4D Attention-Guided Road Damage Detection And Classification
Asma Alkalbani, Muhammad Saqib, Ahmed Salim Alrawahi, Abbas Anwar, Chandarnath Adak, Saeed Anwar
TL;DR
This work tackles the lack of diverse, multi-type road-damage benchmarks by introducing the Diverse Road Damage Dataset (DRDD) and a 4D attention-enhanced detector, RDD4D. RDD4D extends a RTMDet-based backbone with Attention4D blocks to refine features across scales, enabling improved detection of large and dense road damages. The approach achieves state-of-the-art performance on DRDD (AP up to 0.458 for large damages; overall AP 0.445) and strong results on CrackTinyNet (MAP@.5 ≈ 0.825) with high recall. The combination of a challenging dataset, dynamic soft-label assignment in training, and multi-scale attention yields notable practical impact for automated road maintenance and scalable infrastructure monitoring.
Abstract
Road damage detection and assessment are crucial components of infrastructure maintenance. However, current methods often struggle with detecting multiple types of road damage in a single image, particularly at varying scales. This is due to the lack of road datasets with various damage types having varying scales. To overcome this deficiency, first, we present a novel dataset called Diverse Road Damage Dataset (DRDD) for road damage detection that captures the diverse road damage types in individual images, addressing a crucial gap in existing datasets. Then, we provide our model, RDD4D, that exploits Attention4D blocks, enabling better feature refinement across multiple scales. The Attention4D module processes feature maps through an attention mechanism combining positional encoding and "Talking Head" components to capture local and global contextual information. In our comprehensive experimental analysis comparing various state-of-the-art models on our proposed, our enhanced model demonstrated superior performance in detecting large-sized road cracks with an Average Precision (AP) of 0.458 and maintained competitive performance with an overall AP of 0.445. Moreover, we also provide results on the CrackTinyNet dataset; our model achieved around a 0.21 increase in performance. The code, model weights, dataset, and our results are available on \href{https://github.com/msaqib17/Road_Damage_Detection}{https://github.com/msaqib17/Road\_Damage\_Detection}.
