Explainable Information Design
Yiling Chen, Tao Lin, Wei Tang, Jamie Tucker-Foltz
TL;DR
This paper introduces explainable information design for linear information design with a continuous state in $[0,1]$ and studies $K$-partitional signaling schemes that deterministically map each state-interval to a single signal. It proves a tight worst-case price of explainability of exactly $1/2$, meaning any $K$-partitional policy attains at least half the performance of an unrestricted scheme, and shows this bound is tight. On the algorithmic side, exact optimization is NP-hard, but a polynomial-time $ ext{FPTAS}$ exists for Lipschitz utility functions, and a $1/2$-approximation is achievable for piecewise-constant/Lipschitz utilities by converting an optimal unrestricted policy to a $K$-partition policy via a bi-pooling→partition conversion. The analysis relies on reformulating signaling via mean-preserving contractions, characterizing extreme points as bi-pooling MPCs, and providing a constructive conversion algorithm that preserves a constant fraction of the objective. The results offer a principled trade-off between explainability and performance, with practical implications for designing auditable and transparent information schemes in economics, recommender systems, and policy design. The work also outlines extensions to high-dimensional state spaces and discusses open questions about conditions under which PoE remains 1 and how to handle mass points and discrete states.
Abstract
The optimal signaling schemes in information design (Bayesian persuasion) problems often involve non-explainable randomization or disconnected partitions of state space, which are too intricate to be audited or communicated. We propose explainable information design in the context of information design with a continuous state space, restricting the information designer to use $K$-partitional signaling schemes defined by deterministic and monotone partitions of the state space, where a unique signal is sent for all states in each part. We first prove that the price of explainability (PoE) -- the ratio between the performances of the optimal explainable signaling scheme and unrestricted signaling scheme -- is exactly $1/2$ in the worst case, meaning that partitional signaling schemes are never worse than arbitrary signaling schemes by a factor of 2. We then study the complexity of computing optimal explainable signaling schemes. We show that the exact optimization problem is NP-hard in general. But for Lipschitz utility functions, an $\varepsilon$-approximately optimal explainable signaling scheme can be computed in polynomial time. And for piecewise constant utility functions, we provide an efficient algorithm to find an explainable signaling scheme that provides a $1/2$ approximation to the optimal unrestricted signaling scheme, which matches the worst-case PoE bound. A technical tool we develop is a conversion from any optimal signaling scheme (which satisfies a bi-pooling property) to a partitional signaling scheme that achieves $1/2$ fraction of the expected utility of the former. We use this tool in the proofs of both our PoE result and algorithmic result.
