Reputation-based Persuasion Platforms
Itai Arieli, Omer Madmon, Moshe Tennenholtz
TL;DR
The paper studies a bi-level Bayesian persuasion problem in which a third-party platform controls the sender’s information about users, with the platform aiming to maximize average user welfare. In the one-shot setting, the problem is shown to be strategically equivalent to Bergemann et al.'s market segmentation, allowing efficient computation via their algorithm by mapping persuasion thresholds to valuations through $V=igrace{ u_{ heta}=oldsymbol{eta}+(1-oldsymbol{eta}) au_{ heta}igrace}$ and $oldsymbol{\\pi}=x^*$, with the pricing rule $oldsymbol{\\phi}(oldsymbol{\\pi})=oldsymbol{eta}+(1-oldsymbol{\\beta})p(x)$. A reputation-based extension introduces sequential arrivals and a punishment mechanism that moves the sender to a low-reputation state if truthful signaling is violated; in this setting the optimal policy can be characterized by a two-step procedure that applies Bergemann’s algorithm to the part of the posterior where truthful signaling is not enforced, after solving a finite-dimensional optimization involving $(oldsymbol{\\alpha}, x^T, x^F)$. The results reveal that, under suitable conditions, the reputation mechanism can enhance welfare while maintaining incentive compatibility, with numerical evidence suggesting the platform’s strategy may privilege users who are harder to persuade, especially when the sender is less patient. Overall, the paper contributes a novel two-level information-design framework, a tractable reduction to market segmentation for the one-shot case, and a principled characterization of optimal, reputation-based platform policies, with extensions to richer state spaces and practical implications for reputation-enabled platforms.
Abstract
In this paper, we introduce a two-stage Bayesian persuasion model in which a third-party platform controls the information available to the sender about users' preferences. We aim to characterize the optimal information disclosure policy of the platform, which maximizes average user utility, under the assumption that the sender also follows its own optimal policy. We show that this problem can be reduced to a model of market segmentation, in which probabilities are mapped into valuations. We then introduce a repeated variation of the persuasion platform problem in which myopic users arrive sequentially. In this setting, the platform controls the sender's information about users and maintains a reputation for the sender, punishing it if it fails to act truthfully on a certain subset of signals. We provide a characterization of the optimal platform policy in the reputation-based setting, which is then used to simplify the optimization problem of the platform.
