Mind the Gap! Choice Independence in Using Multilingual LLMs for Persuasive Co-Writing Tasks in Different Languages
Shreyan Biswas, Alexander Erlei, Ujwal Gadiraju
TL;DR
The paper investigates how multilingual LLMs with language-dependent performance affect user reliance and the persuasiveness of charity advertisements produced through AI-assisted co-writing. It uses two pre-registered experiments in English and Spanish, employing an ABScribe-based workflow to measure how exposure in one language biases usage in the other and to assess downstream donation outcomes and donor beliefs. The results show a clear violation of the independence of choice: prior exposure to Spanish LLMs reduces subsequent English LLM use, while donations remain largely unaffected by ad type, though beliefs about AI involvement significantly reduce donations, especially among Spanish-speaking women. These findings have practical implications for deploying multilingual AI assistants, highlighting potential second-order effects on uptake, equity, and donor behavior, and they emphasize the importance of transparent design and user education to mitigate biased generalizations across languages.
Abstract
Recent advances in generative AI have precipitated a proliferation of novel writing assistants. These systems typically rely on multilingual large language models (LLMs), providing globalized workers the ability to revise or create diverse forms of content in different languages. However, there is substantial evidence indicating that the performance of multilingual LLMs varies between languages. Users who employ writing assistance for multiple languages are therefore susceptible to disparate output quality. Importantly, recent research has shown that people tend to generalize algorithmic errors across independent tasks, violating the behavioral axiom of choice independence. In this paper, we analyze whether user utilization of novel writing assistants in a charity advertisement writing task is affected by the AI's performance in a second language. Furthermore, we quantify the extent to which these patterns translate into the persuasiveness of generated charity advertisements, as well as the role of peoples' beliefs about LLM utilization in their donation choices. Our results provide evidence that writers who engage with an LLM-based writing assistant violate choice independence, as prior exposure to a Spanish LLM reduces subsequent utilization of an English LLM. While these patterns do not affect the aggregate persuasiveness of the generated advertisements, people's beliefs about the source of an advertisement (human versus AI) do. In particular, Spanish-speaking female participants who believed that they read an AI-generated advertisement strongly adjusted their donation behavior downwards. Furthermore, people are generally not able to adequately differentiate between human-generated and LLM-generated ads. Our work has important implications for the design, development, integration, and adoption of multilingual LLMs as assistive agents -- particularly in writing tasks.
