Table of Contents
Fetching ...

Data selection method for assessment of autonomous vehicles

Linh Trinh, Ali Anwar, Siegfried Mercelis

TL;DR

The idea is to optimize the similarity between the metadata distribution of the selected data and a predefined metadata distribution that is expected for validation, and shows that the method can perform data selection tasks efficiently.

Abstract

As the popularity of autonomous vehicles has grown, many standards and regulators, such as ISO, NHTSA, and Euro NCAP, require safety validation to ensure a sufficient level of safety before deploying them in the real world. Manufacturers gather a large amount of public road data for this purpose. However, the majority of these validation activities are done manually by humans. Furthermore, the data used to validate each driving feature may differ. As a result, it is essential to have an efficient data selection method that can be used flexibly and dynamically for verification and validation while also accelerating the validation process. In this paper, we present a data selection method that is practical, flexible, and efficient for assessment of autonomous vehicles. Our idea is to optimize the similarity between the metadata distribution of the selected data and a predefined metadata distribution that is expected for validation. Our experiments on the large dataset BDD100K show that our method can perform data selection tasks efficiently. These results demonstrate that our methods are highly reliable and can be used to select appropriate data for the validation of various safety functions.

Data selection method for assessment of autonomous vehicles

TL;DR

The idea is to optimize the similarity between the metadata distribution of the selected data and a predefined metadata distribution that is expected for validation, and shows that the method can perform data selection tasks efficiently.

Abstract

As the popularity of autonomous vehicles has grown, many standards and regulators, such as ISO, NHTSA, and Euro NCAP, require safety validation to ensure a sufficient level of safety before deploying them in the real world. Manufacturers gather a large amount of public road data for this purpose. However, the majority of these validation activities are done manually by humans. Furthermore, the data used to validate each driving feature may differ. As a result, it is essential to have an efficient data selection method that can be used flexibly and dynamically for verification and validation while also accelerating the validation process. In this paper, we present a data selection method that is practical, flexible, and efficient for assessment of autonomous vehicles. Our idea is to optimize the similarity between the metadata distribution of the selected data and a predefined metadata distribution that is expected for validation. Our experiments on the large dataset BDD100K show that our method can perform data selection tasks efficiently. These results demonstrate that our methods are highly reliable and can be used to select appropriate data for the validation of various safety functions.
Paper Structure (7 sections, 6 equations, 4 figures, 3 tables, 2 algorithms)

This paper contains 7 sections, 6 equations, 4 figures, 3 tables, 2 algorithms.

Figures (4)

  • Figure 1: Illustration of data selection workflow in our proposed framework.
  • Figure 2: Performance of our selection method on selecting data of BDD100K videos in various $\rho$ for some ADAS features.
  • Figure 3: The similarity matrix of randomly selected samples in the data selected by DC data_curation is compared against our method with or without similarity filtering.
  • Figure 4: Qualitative example of selected data on BDD100K video by our selection method with OpenStreetMap.