Towards Uncertainty Unification: A Case Study for Preference Learning
Shaoting Peng, Haonan Chen, Katherine Driggs-Campbell
TL;DR
This work addresses the challenge of learning human preferences in the presence of both human and robot uncertainties. It introduces uncertainty-unified preference learning (UUPL), which jointly models discrete human uncertainty levels and integrates them into GP-based preference learning via a Laplace-approximate posterior mean and an uncertainty-weighted GMM for predictive variance. A user-specific calibration procedure aligns uncertainty representations across users, enabling consistent performance. Across simulations and real-user studies, UUPL achieves higher prediction accuracy, faster convergence, and more stable results than baselines, demonstrating the value of explicit uncertainty unification for safer and more interpretable human–robot interactions.
Abstract
Learning human preferences is essential for human-robot interaction, as it enables robots to adapt their behaviors to align with human expectations and goals. However, the inherent uncertainties in both human behavior and robotic systems make preference learning a challenging task. While probabilistic robotics algorithms offer uncertainty quantification, the integration of human preference uncertainty remains underexplored. To bridge this gap, we introduce uncertainty unification and propose a novel framework, uncertainty-unified preference learning (UUPL), which enhances Gaussian Process (GP)-based preference learning by unifying human and robot uncertainties. Specifically, UUPL includes a human preference uncertainty model that improves GP posterior mean estimation, and an uncertainty-weighted Gaussian Mixture Model (GMM) that enhances GP predictive variance accuracy. Additionally, we design a user-specific calibration process to align uncertainty representations across users, ensuring consistency and reliability in the model performance. Comprehensive experiments and user studies demonstrate that UUPL achieves state-of-the-art performance in both prediction accuracy and user rating. An ablation study further validates the effectiveness of human uncertainty model and uncertainty-weighted GMM of UUPL.
