Talk Less, Interact Better: Evaluating In-context Conversational Adaptation in Multimodal LLMs
Yilun Hua, Yoav Artzi
TL;DR
This work addresses whether multimodal LLMs spontaneously develop ad-hoc conventions for efficient communication in in-context conversations. It introduces ICCA, an automated framework that leverages human-human reference-game data to quantify in-context adaptation in MLLMs, measuring utterance length, lexical convergence, and accuracy, with $WNR$ as a sensitive lexical-change metric. Across five state-of-the-art MLLMs, results show a lack of spontaneous convention formation; only heavy prompting can induce some lexical efficiency in a subset of models, and stability/convergence remain poor. The findings reveal a gap between human conversational adaptation and current training/instruction-tuning, and ICCA offers a scalable, automated platform for ongoing evaluation and future improvements in in-context adaptation for multimodal models.
Abstract
Humans spontaneously use increasingly efficient language as interactions progress, by adapting and forming ad-hoc conventions. This phenomenon has been studied extensively using reference games, showing properties of human language that go beyond relaying intents. It remains unexplored whether multimodal large language models (MLLMs) similarly increase communication efficiency during interactions, and what mechanisms they may adopt for this purpose. We introduce ICCA, an automated framework to evaluate such conversational adaptation as an in-context behavior in MLLMs. We evaluate several state-of-the-art MLLMs, and observe that while they may understand the increasingly efficient language of their interlocutor, they do not spontaneously make their own language more efficient over time. This latter ability can only be elicited in some models (e.g., GPT-4) with heavy-handed prompting. This shows that this property of linguistic interaction does not arise from current training regimes, even though it is a common hallmark of human language. ICCA is available at https://github.com/lil-lab/ICCA.
