Asynchronous Training of Mixed-Role Human Actors in a Partially-Observable Environment
Kimberlee Chestnut Chang, Reed Jensen, Rohan Paleja, Sam L. Polk, Rob Seater, Jackson Steilberg, Curran Schiefelbein, Melissa Scheldrup, Matthew Gombolay, Mabel D. Ramirez
TL;DR
The paper addresses the challenge of teaching humans to coordinate in partially observable, mixed-role teams by introducing cooperative asynchronous training with autonomous surrogate teammates in HYBS. It presents a novel experimental design that clusters teammate behaviors from demonstrations to reduce the number of training conditions, and compares human-human training with apprentice and heuristic autonomous partners, plus a no-training baseline. While results show mixed outcomes—humans outperforming agents in training, and evaluation results showing no clear transfer—the study yields concrete design guidelines for building and evaluating cooperative asynchronous training systems and robot surrogates. The work has practical implications for scalable, time-efficient training of human teams in complex, information-fragmented environments and informs future development of autonomous training partners.
Abstract
In cooperative training, humans within a team coordinate on complex tasks, building mental models of their teammates and learning to adapt to teammates' actions in real-time. To reduce the often prohibitive scheduling constraints associated with cooperative training, this article introduces a paradigm for cooperative asynchronous training of human teams in which trainees practice coordination with autonomous teammates rather than humans. We introduce a novel experimental design for evaluating autonomous teammates for use as training partners in cooperative training. We apply the design to a human-subjects experiment where humans are trained with either another human or an autonomous teammate and are evaluated with a new human subject in a new, partially observable, cooperative game developed for this study. Importantly, we employ a method to cluster teammate trajectories from demonstrations performed in the experiment to form a smaller number of training conditions. This results in a simpler experiment design that enabled us to conduct a complex cooperative training human-subjects study in a reasonable amount of time. Through a demonstration of the proposed experimental design, we provide takeaways and design recommendations for future research in the development of cooperative asynchronous training systems utilizing robot surrogates for human teammates.
