Robot Behavior Personalization from Sparse User Feedback
Maithili Patel, Sonia Chernova
TL;DR
The paper tackles personalizing robot assistance for an open, unbounded set of household tasks using sparse user feedback. It introduces TAACo, a two-module framework comprising a Commonsense Module that maps task components to abstract concepts and a Personalization Module that predicts the preferred adaptation $\phi$ and generates explanations, with $\tilde{t}=\{(x,\theta^x,m)\}$ serving as the intermediate representation. TAACo demonstrates strong empirical performance, achieving $\approx 0.71$ prediction accuracy with 40 user-feedback samples (oracle $=0.89$) and explaining its decisions effectively, outperforming GPT-4 and a rule-based baseline, while enabling per-user local training and explainability. The approach is validated on real-user data from five older adults and demonstrated on a Stretch robot, highlighting its potential to deliver personalized, interpretable, and data-efficient home-robot assistance. Overall, TAACo advances open-set personalisation in HRI by leveraging abstract concepts to bridge user preferences with robot policies and by producing faithful explanations that align with user reasoning.
Abstract
As service robots become more general-purpose, they will need to adapt to their users' preferences over a large set of all possible tasks that they can perform. This includes preferences regarding which actions the users prefer to delegate to robots as opposed to doing themselves. Existing personalization approaches require task-specific data for each user. To handle diversity across all household tasks and users, and nuances in user preferences across tasks, we propose to learn a task adaptation function independently, which can be used in tandem with any universal robot policy to customize robot behavior. We create Task Adaptation using Abstract Concepts (TAACo) framework. TAACo can learn to predict the user's preferred manner of assistance with any given task, by mediating reasoning through a representation composed of abstract concepts built based on user feedback. TAACo can generalize to an open set of household tasks from small amount of user feedback and explain its inferences through intuitive concepts. We evaluate our model on a dataset we collected of 5 people's preferences, and show that TAACo outperforms GPT-4 by 16% and a rule-based system by 54%, on prediction accuracy, with 40 samples of user feedback.
