A Unified Framework for Robots that Influence Humans over Long-Term Interaction
Shahabedin Sagheb, Sagar Parekh, Ravi Pandya, Ye-Ji Mun, Katherine Driggs-Campbell, Andrea Bajcsy, Dylan P. Losey
TL;DR
The paper addresses the challenge that robots must influence humans not only in the short term but over repeated interactions, as humans adapt their behavior. It proposes a unifying control-theoretic framework that models the human partner as history-aware with short- and long-term latent dynamics, formulating the problem as a mixed-observability Markov decision process. The authors show that existing game-theoretic and latent-representation approaches are special cases of this framework, and demonstrate through simulations and user studies that the unified approach yields more reliable long-term influence and task performance. They also provide tractable approximations (e.g., One-Step) and validate the method in aerial-drone and driving scenarios, highlighting practical benefits for safe and efficient human-robot coordination. The work advances embodied AI by offering principled, scalable means to regulate robot influence in dynamic, long-term human-robot interactions.
Abstract
Robot actions influence the decisions of nearby humans. Here influence refers to intentional change: robots influence humans when they shift the human's behavior in a way that helps the robot complete its task. Imagine an autonomous car trying to merge; by proactively nudging into the human's lane, the robot causes human drivers to yield and provide space. Influence is often necessary for seamless interaction. However, if influence is left unregulated and uncontrolled, robots will negatively impact the humans around them. Prior works have begun to address this problem by creating a variety of control algorithms that seek to influence humans. Although these methods are effective in the short-term, they fail to maintain influence over time as the human adapts to the robot's behaviors. In this paper we therefore present an optimization framework that enables robots to purposely regulate their influence over humans across both short-term and long-term interactions. Here the robot maintains its influence by reasoning over a dynamic human model which captures how the robot's current choices will impact the human's future behavior. Our resulting framework serves to unify current approaches: we demonstrate that state-of-the-art methods are simplifications of our underlying formalism. Our framework also provides a principled way to generate influential policies: in the best case the robot exactly solves our framework to find optimal, influential behavior. But when solving this optimization problem becomes impractical, designers can introduce their own simplifications to reach tractable approximations. We experimentally compare our unified framework to state-of-the-art baselines and ablations, and demonstrate across simulations and user studies that this framework is able to successfully influence humans over repeated interactions. See videos of our experiments here: https://youtu.be/nPekTUfUEbo
