ARMCHAIR: integrated inverse reinforcement learning and model predictive control for human-robot collaboration
Angelo Caregnato-Neto, Luciano Cavalcante Siebert, Arkady Zgonnikov, Marcos Ricardo Omena de Albuquerque Maximo, Rubens Junqueira Magalhães Afonso
TL;DR
The paper tackles the challenge of integrating predictive models of human behavior with multi-robot motion planning for collaborative exploration. It introduces ARMCHAIR, which combines adversarial inverse reinforcement learning to learn a human prediction model with a receding-horizon MPC-MIP planner that jointly optimizes trajectories and task allocation while enforcing network connectivity. The approach includes a robustification via safety regions to handle prediction uncertainty and is validated through extensive Monte Carlo simulations in two environments, showing improved safety (fewer collisions and disconnections) and task performance compared to baselines. While promising, the authors note limitations such as lack of formal recursive feasibility proofs and centralized scalability, and propose future work on distributed planning and real-human experiments to further validate the framework.
Abstract
One of the key issues in human-robot collaboration is the development of computational models that allow robots to predict and adapt to human behavior. Much progress has been achieved in developing such models, as well as control techniques that address the autonomy problems of motion planning and decision-making in robotics. However, the integration of computational models of human behavior with such control techniques still poses a major challenge, resulting in a bottleneck for efficient collaborative human-robot teams. In this context, we present a novel architecture for human-robot collaboration: Adaptive Robot Motion for Collaboration with Humans using Adversarial Inverse Reinforcement learning (ARMCHAIR). Our solution leverages adversarial inverse reinforcement learning and model predictive control to compute optimal trajectories and decisions for a mobile multi-robot system that collaborates with a human in an exploration task. During the mission, ARMCHAIR operates without human intervention, autonomously identifying the necessity to support and acting accordingly. Our approach also explicitly addresses the network connectivity requirement of the human-robot team. Extensive simulation-based evaluations demonstrate that ARMCHAIR allows a group of robots to safely support a simulated human in an exploration scenario, preventing collisions and network disconnections, and improving the overall performance of the task.
