Instance-Level Safety-Aware Fidelity of Synthetic Data and Its Calibration

Chih-Hong Cheng; Paul Stöckel; Xingyu Zhao

Instance-Level Safety-Aware Fidelity of Synthetic Data and Its Calibration

Chih-Hong Cheng, Paul Stöckel, Xingyu Zhao

TL;DR

An optimization method is suggested to refine the synthetic data generator, reducing fidelity gaps identified by deep learning components, and experiments show this tuning enhances the correlation between safety-critical errors in synthetic and real data.

Abstract

Modeling and calibrating the fidelity of synthetic data is paramount in shaping the future of safe and reliable self-driving technology by offering a cost-effective and scalable alternative to real-world data collection. We focus on its role in safety-critical applications, introducing four types of instance-level fidelity that go beyond mere visual input characteristics. The aim is to ensure that applying testing on synthetic data can reveal real-world safety issues, and the absence of safety-critical issues when testing under synthetic data can provide a strong safety guarantee in real-world behavior. We suggest an optimization method to refine the synthetic data generator, reducing fidelity gaps identified by deep learning components. Experiments show this tuning enhances the correlation between safety-critical errors in synthetic and real data.

Instance-Level Safety-Aware Fidelity of Synthetic Data and Its Calibration

TL;DR

Abstract

Paper Structure (20 sections, 1 theorem, 11 equations, 4 figures, 1 table)

This paper contains 20 sections, 1 theorem, 11 equations, 4 figures, 1 table.

Introduction
Related Work
Foundations
The Facets of Instance-level Fidelity
Input, output, and latent feature fidelity
Safety-aware fidelity
Calibrating Data Generation for Safety-aware Fidelity
Evaluation
Research questions
RQ1 (assessment): Can the proposed SA-fidelity metric effectively differentiate and rank different synthetic data generators in terms of safety?
RQ2 (calibration): For a given synthetic data generator, can SA-fidelity be utilized for calibrating optimal configurations geared toward safety?
RQ3 (integration with scenario-based testing): When integrating SA-fidelity concepts into the engineering process of scenario-based virtual testing, what are additional challenges to be considered?
Experiment setup
Synthetic data generation
DNNs under testing
...and 5 more sections

Key Result

Lemma 1

Given a synthetic data point $x^s = g_{\theta}(sd)$, assume the perception module $f$ is locally robust with a maximum robustness bound $\epsilon_{r}$, i.e., then if a synthetic data point $x^s$ is IV-fidelitous under $\langle \mathcal{D}_{in},\epsilon_{in} \rangle$ where $\epsilon_{in} \leq \epsilon_{r}$, $x^s$ is also OV-fidelitous under $\langle \mathcal{D}_{out},\epsilon_{out} \rangle$.

Figures (4)

Figure 1: The spectrum of instance-level fidelity from the perspectives of DNN inputs, features extracted, DNN outputs and its effect on system safety.
Figure 2: Example of a real-world KITTI image (a), and its synthetic images generated via "style transfer" (b-d) and "scenario understanding" (e).
Figure 3: Examples of inconsistent object detection on safety-relevant objects in real vs. synthetic images.
Figure 4: Statistics of inconsistent predictions, over 3 DNNs and 3 synthetic datasets.

Theorems & Definitions (16)

Remark 1: Approximation of $\emph{{\smallRW}}(sd)$
Definition 1: IV-fidelity
Definition 2: OV-fidelity
Remark 2: Conservative bound on distance-based fidelity
Lemma 1: IV-fidelity, robustness, and OV-fidelity
proof
Remark 3: Photo-realistic is not real
Definition 3: LF-fidelity
Example 1
Example 1: continued
...and 6 more

Instance-Level Safety-Aware Fidelity of Synthetic Data and Its Calibration

TL;DR

Abstract

Instance-Level Safety-Aware Fidelity of Synthetic Data and Its Calibration

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (4)

Theorems & Definitions (16)