Uncertainty-Aware Measurement of Scenario Suite Representativeness for Autonomous Systems
Robab Aghazadeh Chakherlou, Siddartha Khastgir, Xingyu Zhao, Jerein Jeyachandran, Shufeng Chen
TL;DR
This work addresses how to quantify representativeness of scenario data for autonomous systems when the Target Operational Domain $TOD$ is unknown and must be inferred from data. It introduces an imprecise Bayesian framework that models the TOD as an uncertain categorical distribution and propagates prior imprecision through Dirichlet posteriors to yield interval estimates of representativeness. Representativeness is quantified via two distributional distances, $D_{TV}$ and $D_{JS}$, which become interval-valued under epistemic uncertainty, providing transparent bounds on alignment between the scenario suite and TOD. The numerical example demonstrates strong global representativeness despite deliberate oversampling of safety-critical conditions and shows how prior strength and dependencies influence the uncertainty bounds, supporting auditable safety arguments and potential Integration into Assurance 2.0.
Abstract
Assuring the trustworthiness and safety of AI systems, e.g., autonomous vehicles (AV), depends critically on the data-related safety properties, e.g., representativeness, completeness, etc., of the datasets used for their training and testing. Among these properties, this paper focuses on representativeness-the extent to which the scenario-based data used for training and testing, reflect the operational conditions that the system is designed to operate safely in, i.e., Operational Design Domain (ODD) or expected to encounter, i.e., Target Operational Domain (TOD). We propose a probabilistic method that quantifies representativeness by comparing the statistical distribution of features encoded by the scenario suites with the corresponding distribution of features representing the TOD, acknowledging that the true TOD distribution is unknown, as it can only be inferred from limited data. We apply an imprecise Bayesian method to handle limited data and uncertain priors. The imprecise Bayesian formulation produces interval-valued, uncertainty-aware estimates of representativeness, rather than a single value. We present a numerical example comparing the distributions of the scenario suite and the inferred TOD across operational categories-weather, road type, time of day, etc., under dependencies and prior uncertainty. We estimate representativeness locally (between categories) and globally as an interval.
