Uncertainty-Aware Deployment of Pre-trained Language-Conditioned Imitation Learning Policies
Bo Wu, Bruce D. Lee, Kostas Daniilidis, Bernadette Bucher, Nikolai Matni
TL;DR
The paper addresses unreliable generalization of pre-trained language-conditioned imitation policies by introducing a calibration-enabled uncertainty-aware deployment framework. It calibrates model outputs with temperature scaling to align confidence with imitation correctness and employs uncertainty-aware action selection that aggregates neighboring action confidences within a distance threshold. The approach is evaluated on three models (PerAct, RVT, CLIPort) across RLBench and Ravens, showing improved task success, particularly when models are poorly calibrated or faced with distribution shifts like distractors. Importantly, the method does not require retraining the original policies, offering a scalable path to more reliable generalist robotic policies in diverse environments. The work highlights the situational dependence of calibration benefits and points to future work in extending calibrated uncertainty to dynamics forecasting and planning.
Abstract
Large-scale robotic policies trained on data from diverse tasks and robotic platforms hold great promise for enabling general-purpose robots; however, reliable generalization to new environment conditions remains a major challenge. Toward addressing this challenge, we propose a novel approach for uncertainty-aware deployment of pre-trained language-conditioned imitation learning agents. Specifically, we use temperature scaling to calibrate these models and exploit the calibrated model to make uncertainty-aware decisions by aggregating the local information of candidate actions. We implement our approach in simulation using three such pre-trained models, and showcase its potential to significantly enhance task completion rates. The accompanying code is accessible at the link: https://github.com/BobWu1998/uncertainty_quant_all.git
