I've Changed My Mind: Robots Adapting to Changing Human Goals during Collaboration
Debasmita Ghose, Oz Gitelson, Ryan Jin, Grace Abawe, Marynel Vazquez, Brian Scassellati
TL;DR
The paper tackles the problem of human goals changing mid-task in human-robot collaboration by explicitly modeling goal changes through tracking multiple candidate human action sequences against a policy bank $\mathcal{P}$ and a goal bank $\mathcal{G}$. It introduces Differentiating Actions and Receding Horizon Planning (RHP) to actively influence the human to reveal updated goals while assisting toward the new objective, and it maintains multiple plausible histories to detect switches promptly. The approach is evaluated in a collaborative cooking domain with up to 30 recipes, showing faster and more reliable goal identification and improved collaboration efficiency compared with baseline methods like Recursive Bayesian, CDP, and Information Gain Maximization. The results from both simulation and a physical robot demonstrate practical benefits for robust, real-time goal inference and adaptive assistance in dynamic, real-world tasks.
Abstract
For effective human-robot collaboration, a robot must align its actions with human goals, even as they change mid-task. Prior approaches often assume fixed goals, reducing goal prediction to a one-time inference. However, in real-world scenarios, humans frequently shift goals, making it challenging for robots to adapt without explicit communication. We propose a method for detecting goal changes by tracking multiple candidate action sequences and verifying their plausibility against a policy bank. Upon detecting a change, the robot refines its belief in relevant past actions and constructs Receding Horizon Planning (RHP) trees to actively select actions that assist the human while encouraging Differentiating Actions to reveal their updated goal. We evaluate our approach in a collaborative cooking environment with up to 30 unique recipes and compare it to three comparable human goal prediction algorithms. Our method outperforms all baselines, quickly converging to the correct goal after a switch, reducing task completion time, and improving collaboration efficiency.
