Table of Contents
Fetching ...

ICPR 2024 Competition on Safe Segmentation of Drive Scenes in Unstructured Traffic and Adverse Weather Conditions

Furqan Ahmed Shaik, Sandeep Nagar, Aiswarya Maturi, Harshit Kumar Sankhla, Dibyendu Ghosh, Anshuman Majumdar, Srikanth Vidapanakal, Kunal Chaudhary, Sunny Manchanda, Girish Varma

Abstract

The ICPR 2024 Competition on Safe Segmentation of Drive Scenes in Unstructured Traffic and Adverse Weather Conditions served as a rigorous platform to evaluate and benchmark state-of-the-art semantic segmentation models under challenging conditions for autonomous driving. Over several months, participants were provided with the IDD-AW dataset, consisting of 5000 high-quality RGB-NIR image pairs, each annotated at the pixel level and captured under adverse weather conditions such as rain, fog, low light, and snow. A key aspect of the competition was the use and improvement of the Safe mean Intersection over Union (Safe mIoU) metric, designed to penalize unsafe incorrect predictions that could be overlooked by traditional mIoU. This innovative metric emphasized the importance of safety in developing autonomous driving systems. The competition showed significant advancements in the field, with participants demonstrating models that excelled in semantic segmentation and prioritized safety and robustness in unstructured and adverse conditions. The results of the competition set new benchmarks in the domain, highlighting the critical role of safety in deploying autonomous vehicles in real-world scenarios. The contributions from this competition are expected to drive further innovation in autonomous driving technology, addressing the critical challenges of operating in diverse and unpredictable environments.

ICPR 2024 Competition on Safe Segmentation of Drive Scenes in Unstructured Traffic and Adverse Weather Conditions

Abstract

The ICPR 2024 Competition on Safe Segmentation of Drive Scenes in Unstructured Traffic and Adverse Weather Conditions served as a rigorous platform to evaluate and benchmark state-of-the-art semantic segmentation models under challenging conditions for autonomous driving. Over several months, participants were provided with the IDD-AW dataset, consisting of 5000 high-quality RGB-NIR image pairs, each annotated at the pixel level and captured under adverse weather conditions such as rain, fog, low light, and snow. A key aspect of the competition was the use and improvement of the Safe mean Intersection over Union (Safe mIoU) metric, designed to penalize unsafe incorrect predictions that could be overlooked by traditional mIoU. This innovative metric emphasized the importance of safety in developing autonomous driving systems. The competition showed significant advancements in the field, with participants demonstrating models that excelled in semantic segmentation and prioritized safety and robustness in unstructured and adverse conditions. The results of the competition set new benchmarks in the domain, highlighting the critical role of safety in deploying autonomous vehicles in real-world scenarios. The contributions from this competition are expected to drive further innovation in autonomous driving technology, addressing the critical challenges of operating in diverse and unpredictable environments.
Paper Structure (29 sections, 7 equations, 7 figures, 3 tables)

This paper contains 29 sections, 7 equations, 7 figures, 3 tables.

Figures (7)

  • Figure 1: Examples of RGB/NIR/Annotations triplets in four adverse weather conditions from IDD-AW.
  • Figure 2: Example of tree distances between labels. Tree distance is used to assess the severity of mispredictions. It measures the distance between the predicted label and the correct label within the hierarchy. Red, orange, and yellow colors represent different levels of severity, with red indicating the most severe mispredictions and yellow indicating the least severe. The $\text{td} (\text{sidewalk},\text{motorcycle})=3$ since the length of the path is 6 and td is length/2. Similarly $\text{td} (\text{person},\text{rider})=2$ and $\text{td}(\text{truck},\text{bus})=1$.
  • Figure 3: Qualitative Comparision of predicted colourmaps between ground truth and top 3 participants. The pixels inside the red highlighted boxes show the major differences between the ground truth and all 3 approaches.
  • Figure 4: Distribution of Safe mIoU (%) vs mIoU (%) for top 3 participants and the IDD-AW paper baseline on IDD-AW test set. y-axis represents the number of images with a particular value of SmIoU/mIoU.
  • Figure 5: Fusion net architecture.
  • ...and 2 more figures