Risk-Calibrated Human-Robot Interaction via Set-Valued Intent Prediction
Justin Lidard, Hang Pham, Ariel Bachman, Bryan Boateng, Anirudha Majumdar
TL;DR
RCIP addresses risk-aware human-robot interaction under latent, multi-modal intents by coupling set-valued intent prediction with statistical risk calibration. It frames action selection as a sequence-level multi-hypothesis testing problem, providing finite-sample guarantees on cumulative loss while enabling flexible autonomy through tunable parameters $(\lambda,\theta)$ and risk budgets $(\alpha_1,\dots,\alpha_K)$. The approach supports both task-specific and zero-shot intent predictors and extends conformal prediction to sequences with a calibration stage for multi-risk objectives. Empirical results across four domains (simulation and hardware) show that RCIP preserves high task success while substantially reducing human input compared to baselines, demonstrating practical certifiable autonomy in diverse interactive settings.
Abstract
Tasks where robots must anticipate human intent, such as navigating around a cluttered home or sorting everyday items, are challenging because they exhibit a wide range of valid actions that lead to similar outcomes. Moreover, zero-shot cooperation between human-robot partners is an especially challenging problem because it requires the robot to infer and adapt on the fly to a latent human intent, which could vary significantly from human to human. Recently, deep learned motion prediction models have shown promising results in predicting human intent but are prone to being confidently incorrect. In this work, we present Risk-Calibrated Interactive Planning (RCIP), which is a framework for measuring and calibrating risk associated with uncertain action selection in human-robot cooperation, with the fundamental idea that the robot should ask for human clarification when the risk associated with the uncertainty in the human's intent cannot be controlled. RCIP builds on the theory of set-valued risk calibration to provide a finite-sample statistical guarantee on the cumulative loss incurred by the robot while minimizing the cost of human clarification in complex multi-step settings. Our main insight is to frame the risk control problem as a sequence-level multi-hypothesis testing problem, allowing efficient calibration using a low-dimensional parameter that controls a pre-trained risk-aware policy. Experiments across a variety of simulated and real-world environments demonstrate RCIP's ability to predict and adapt to a diverse set of dynamic human intents.
