Influence-Based Reward Modulation for Implicit Communication in Human-Robot Interaction
Haoyang Jiang, Elizabeth A. Croft, Michael G. Burke
TL;DR
This work addresses implicit communication in human-robot interaction by proposing a model-free framework that modulates social influence with Transfer Entropy ($TE$) via reward augmentation in a partially observable Markov decision process ($POMDP$). The approach computes $TE$ from other agents’ histories and enriches the ego-agent’s reward to encourage (or resist) information transfer, learned through $Q$-learning with a softmax policy. Across simulations, virtual human-agent tests, and real human-robot experiments, boosting influence improves collaboration and, in competitive settings, alters human performance; resisting influence generally hampers collaborative outcomes. The findings demonstrate a practical, model-free mechanism for shaping implicit communication in HRI, with potential applications in cooperative and adversarial social navigation and broad implications for design and ethics in autonomous systems.
Abstract
Communication is essential for successful interaction. In human-robot interaction, implicit communication holds the potential to enhance robots' understanding of human needs, emotions, and intentions. This paper introduces a method to foster implicit communication in HRI without explicitly modelling human intentions or relying on pre-existing knowledge. Leveraging Transfer Entropy, we modulate influence between agents in social interactions in scenarios involving either collaboration or competition. By integrating influence into agents' rewards within a partially observable Markov decision process, we demonstrate that boosting influence enhances collaboration, while resisting influence diminishes performance. Our findings are validated through simulations and real-world experiments with human participants in social navigation settings.
