To Ask or Not To Ask: Human-in-the-loop Contextual Bandits with Applications in Robot-Assisted Feeding
Rohan Banerjee, Rajat Kumar Jenamani, Sidharth Vasudev, Amal Nanavati, Katherine Dimitropoulou, Sarah Dean, Tapomayukh Bhattacharjee
TL;DR
This work develops a human-in-the-loop contextual bandit framework for robot-assisted bite acquisition, introducing LinUCB-QG, which balances autonomous action with selective user querying by leveraging a data-driven workload predictor. The workload model is trained on a diverse dataset including mobility-impaired users and OTs, enabling the system to anticipate querying workload and adapt its querying strategy. In simulation and a real-world Kinova-based study with 19 participants, LinUCB-QG achieves higher task success than autonomous baselines while imposing substantially less workload than always querying, including significant gains for users with mobility limitations. The approach advances practical, user-centered robotic feeding by reducing cognitive and physical burden through nonintrusive workload modeling and adaptive querying policies with strong empirical support. Collectively, the work demonstrates the feasibility and value of workload-aware human-in-the-loop policies for complex manipulation tasks in assistive robotics.
Abstract
Robot-assisted bite acquisition involves picking up food items with varying shapes, compliance, sizes, and textures. Fully autonomous strategies may not generalize efficiently across this diversity. We propose leveraging feedback from the care recipient when encountering novel food items. However, frequent queries impose a workload on the user. We formulate human-in-the-loop bite acquisition within a contextual bandit framework and introduce LinUCB-QG, a method that selectively asks for help using a predictive model of querying workload based on query types and timings. This model is trained on data collected in an online study involving 14 participants with mobility limitations, 3 occupational therapists simulating physical limitations, and 89 participants without limitations. We demonstrate that our method better balances task performance and querying workload compared to autonomous and always-querying baselines and adjusts its querying behavior to account for higher workload in users with mobility limitations. We validate this through experiments in a simulated food dataset and a user study with 19 participants, including one with severe mobility limitations. Please check out our project website at: http://emprise.cs.cornell.edu/hilbiteacquisition/
