Comprehensive Assessment of LiDAR Evaluation Metrics: A Comparative Study Using Simulated and Real Data

Syed Mostaquim Ali; Taufiq Rahman; Ghazal Farhani; Mohamed H. Zaki; Benoit Anctil; Dominique Charlebois

Comprehensive Assessment of LiDAR Evaluation Metrics: A Comparative Study Using Simulated and Real Data

Syed Mostaquim Ali, Taufiq Rahman, Ghazal Farhani, Mohamed H. Zaki, Benoit Anctil, Dominique Charlebois

TL;DR

This work tackles the challenge of validating virtual testing environments for autonomous driving by developing and evaluating geometric LiDAR similarity metrics. It identifies Density Aware Chamfer Distance (DCD) as the most robust metric across noise, density, distortion, and sensor settings, and demonstrates a practical two-step approach (DCD with $\alpha=1$ followed by Chamfer Distance) to assess LiDAR realism. The authors build a CARLA-based digital twin from real-world data, generate simulated LiDAR scans under varying ODDs, and show that intensity correction significantly improves perception-model alignment, achieving a mean IoU increase from 4.7% to 21.2%. The study highlights the importance of geometry-driven benchmarks for VTE credibility while acknowledging limitations in reflectivity/intensity modeling and vegetation representation, and points to future directions in richer mesh generation and multimodal validation.

Abstract

For developing safe Autonomous Driving Systems (ADS), rigorous testing is required before they are deemed safe for road deployments. Since comprehensive conventional physical testing is impractical due to cost and safety concerns, Virtual Testing Environments (VTE) can be adopted as an alternative. Comparing VTE-generated sensor outputs against their real-world analogues can be a strong indication that the VTE accurately represents reality. Correspondingly, this work explores a comprehensive experimental approach to finding evaluation metrics suitable for comparing real-world and simulated LiDAR scans. The metrics were tested in terms of sensitivity and accuracy with different noise, density, distortion, sensor orientation, and channel settings. From comparing the metrics, we found that Density Aware Chamfer Distance (DCD) works best across all cases. In the second step of the research, a Virtual Testing Environment was generated using real LiDAR scan data. The data was collected in a controlled environment with only static objects using an instrumented vehicle equipped with LiDAR, IMU and cameras. Simulated LiDAR scans were generated from the VTEs using the same pose as real LiDAR scans. The simulated and LiDAR scans were compared in terms of model perception and geometric similarity. Actual and simulated LiDAR scans have a similar semantic segmentation output with a mIoU of 21\% with corrected intensity and an average density aware chamfer distance (DCD) of 0.63. This indicates a slight difference in the geometric properties of simulated and real LiDAR scans and a significant difference between model outputs. During the comparison, density-aware chamfer distance was found to be the most correlated among the metrics with perception methods.

Comprehensive Assessment of LiDAR Evaluation Metrics: A Comparative Study Using Simulated and Real Data

TL;DR

Abstract

Comprehensive Assessment of LiDAR Evaluation Metrics: A Comparative Study Using Simulated and Real Data

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (23)