Sample-Efficient Safety Assurances using Conformal Prediction
Rachel Luo, Shengjia Zhao, Jonathan Kuck, Boris Ivanovic, Silvio Savarese, Edward Schmerling, Marco Pavone
TL;DR
The paper tackles the challenge of guaranteeing safety in robotics warning systems with limited data. It adapts Mondrian conformal prediction to a robotics setting, enabling provable $\epsilon$-safe false negative rates using as few as $1/\epsilon$ unsafe examples, under a single exchangeability assumption. The approach is instantiated and validated on driver alert and robotic grasping tasks, demonstrating tight control of the false negative rate while maintaining a low false positive rate, and offering favorable sample-efficiency relative to traditional PAC learning. The work highlights the practical impact of provable safety guarantees in time-series, sequential-decision robotics, and sets the stage for future extensions to non-exchangeable data and conditional safety guarantees.
Abstract
When deploying machine learning models in high-stakes robotics applications, the ability to detect unsafe situations is crucial. Early warning systems can provide alerts when an unsafe situation is imminent (in the absence of corrective action). To reliably improve safety, these warning systems should have a provable false negative rate; i.e. of the situations that are unsafe, fewer than $ε$ will occur without an alert. In this work, we present a framework that combines a statistical inference technique known as conformal prediction with a simulator of robot/environment dynamics, in order to tune warning systems to provably achieve an $ε$ false negative rate using as few as $1/ε$ data points. We apply our framework to a driver warning system and a robotic grasping application, and empirically demonstrate guaranteed false negative rate while also observing low false detection (positive) rate.
