LIMIT: Learning Interfaces to Maximize Information Transfer
Benjamin A. Christie, Dylan P. Losey
TL;DR
LIMIT tackles the problem of learning interpretable, personalized interfaces that convey hidden information from robots to humans by maximizing the conditional mutual information $I(a; \\theta \\mid s)$. It introduces a three-network online framework—$\\mathcal{H}_{\\phi}$, $\\mathcal{R}_{\\psi}$, and $\\Delta_{\\sigma}$—to maximize a tractable proxy of information gain through losses $\\mathcal{L}_{conv}$ and $\\mathcal{L}_{dist}$, while accounting for human co-adaptation with a bias toward recent data. The approach is modality-agnostic and demonstrated across simulated 1D/2D tasks, online user studies, and in-person experiments with auditory, visual, and haptic feedback, showing improved task performance and user-perceived interpretability compared to baselines like Naive and Bayes. By learning from interactions rather than relying on task models or pre-defined mappings, LIMIT enables personalized, co-adaptive interfaces that can transfer information more efficiently in diverse human-robot contexts, with potential applications in sensory substitution and BCIs. The work advances the design of intuitive, information-driven feedback mechanisms for complex human-in-the-loop systems, while outlining future work to fuse task-specific rewards with the information-transfer objective to ensure necessary, meaningful signaling.
Abstract
Robots can use auditory, visual, or haptic interfaces to convey information to human users. The way these interfaces select signals is typically pre-defined by the designer: for instance, a haptic wristband might vibrate when the robot is moving and squeeze when the robot stops. But different people interpret the same signals in different ways, so that what makes sense to one person might be confusing or unintuitive to another. In this paper we introduce a unified algorithmic formalism for learning co-adaptive interfaces from scratch. Our method does not need to know the human's task (i.e., what the human is using these signals for). Instead, our insight is that interpretable interfaces should select signals that maximize correlation between the human's actions and the information the interface is trying to convey. Applying this insight we develop LIMIT: Learning Interfaces to Maximize Information Transfer. LIMIT optimizes a tractable, real-time proxy of information gain in continuous spaces. The first time a person works with our system the signals may appear random; but over repeated interactions the interface learns a one-to-one mapping between displayed signals and human responses. Our resulting approach is both personalized to the current user and not tied to any specific interface modality. We compare LIMIT to state-of-the-art baselines across controlled simulations, an online survey, and an in-person user study with auditory, visual, and haptic interfaces. Overall, our results suggest that LIMIT learns interfaces that enable users to complete the task more quickly and efficiently, and users subjectively prefer LIMIT to the alternatives. See videos here: https://youtu.be/IvQ3TM1_2fA
