How Safe Am I Given What I See? Calibrated Prediction of Safety Chances for Image-Controlled Autonomy
Zhenjiang Mao, Carson Sobolewski, Ivan Ruchkin
TL;DR
This work tackles online safety prediction for image-controlled autonomous systems in the absence of a low-dimensional dynamical state. It introduces a configurable family of world-model–based predictors that operate on high-dimensional observations, with two predictor types (safety labels and safety chances) trained offline on controller-specific or controller-independent data. A post-hoc conformal calibration framework, combined with adaptive binning, provides statistical guarantees that the predicted safety probabilities bound the true safety rates with coverage $1-\alpha$. Empirical results on racing car and cart pole benchmarks show latent predictors and calibrated, conformally bounded safety chances yield more reliable decisions than uncalibrated or purely image-based approaches, supporting practical online safety interventions. The proposed calibration-friendly, state-agnostic approach offers scalable, interpretable reliability for high-dimensional perception in autonomous systems, with potential for broader safety assurances in vision-based control.
Abstract
End-to-end learning has emerged as a major paradigm for developing autonomous systems. Unfortunately, with its performance and convenience comes an even greater challenge of safety assurance. A key factor of this challenge is the absence of the notion of a low-dimensional and interpretable dynamical state, around which traditional assurance methods revolve. Focusing on the online safety prediction problem, this paper proposes a configurable family of learning pipelines based on generative world models, which do not require low-dimensional states. To implement these pipelines, we overcome the challenges of learning safety-informed latent representations and missing safety labels under prediction-induced distribution shift. These pipelines come with statistical calibration guarantees on their safety chance predictions based on conformal prediction. We perform an extensive evaluation of the proposed learning pipelines on two case studies of image-controlled systems: a racing car and a cartpole.
