Table of Contents
Fetching ...

Learning-Based Robust Bayesian Persuasion with Conformal Prediction Guarantees

Heeseung Bang, Andreas A. Malikopoulos

TL;DR

This work addresses robust Bayesian persuasion when receivers form beliefs via private or non-Bayesian processes by marrying neural networks with conformal prediction to produce finite-sample, distribution-free guarantees. The core idea is to learn an end-to-end mapping from receiver observations, sender signals, and signaling policies to action distributions, while enclosing possible receiver actions in conformal prediction sets $C_{1-\alpha}(Y,S,\pi)$. It provides exact coverage guarantees for data-generating policies and derives bounds on coverage degradation under policy shifts, along with neural-network approximation and estimation error bounds and a finite-sample lower bound on sender utility. Numerical experiments in smart-grid energy management demonstrate robustness to private information and behavioral heterogeneity, with practical guidance for policy transfer and calibration in real systems.

Abstract

Classical Bayesian persuasion assumes that senders fully understand how receivers form beliefs and make decisions--an assumption that rarely holds when receivers possess private information or exhibit non-Bayesian behavior. In this paper, we develop a learning-based framework that integrates neural networks with conformal prediction to achieve robust persuasion under uncertainty about receiver belief formation. The proposed neural architecture learns end-to-end mappings from receiver observations and sender signals to action predictions, eliminating the need to identify belief mechanisms explicitly. Conformal prediction constructs finite-sample valid prediction sets with provable marginal coverage, enabling principled, distribution-free robust optimization. We establish exact coverage guarantees for the data-generating policy and derive bounds on coverage degradation under policy shifts. Furthermore, we provide neural network approximation and estimation error bounds, with sample complexity $O(d \log(|\mathcal{U}||\mathcal{Y}||\mathcal{S}|)/\varepsilon^2)$, where $d$ denotes the effective network dimension, and finite-sample lower bounds on the sender's expected utility. Numerical experiments on smart-grid energy management illustrate the framework's robustness.

Learning-Based Robust Bayesian Persuasion with Conformal Prediction Guarantees

TL;DR

This work addresses robust Bayesian persuasion when receivers form beliefs via private or non-Bayesian processes by marrying neural networks with conformal prediction to produce finite-sample, distribution-free guarantees. The core idea is to learn an end-to-end mapping from receiver observations, sender signals, and signaling policies to action distributions, while enclosing possible receiver actions in conformal prediction sets . It provides exact coverage guarantees for data-generating policies and derives bounds on coverage degradation under policy shifts, along with neural-network approximation and estimation error bounds and a finite-sample lower bound on sender utility. Numerical experiments in smart-grid energy management demonstrate robustness to private information and behavioral heterogeneity, with practical guidance for policy transfer and calibration in real systems.

Abstract

Classical Bayesian persuasion assumes that senders fully understand how receivers form beliefs and make decisions--an assumption that rarely holds when receivers possess private information or exhibit non-Bayesian behavior. In this paper, we develop a learning-based framework that integrates neural networks with conformal prediction to achieve robust persuasion under uncertainty about receiver belief formation. The proposed neural architecture learns end-to-end mappings from receiver observations and sender signals to action predictions, eliminating the need to identify belief mechanisms explicitly. Conformal prediction constructs finite-sample valid prediction sets with provable marginal coverage, enabling principled, distribution-free robust optimization. We establish exact coverage guarantees for the data-generating policy and derive bounds on coverage degradation under policy shifts. Furthermore, we provide neural network approximation and estimation error bounds, with sample complexity , where denotes the effective network dimension, and finite-sample lower bounds on the sender's expected utility. Numerical experiments on smart-grid energy management illustrate the framework's robustness.

Paper Structure

This paper contains 16 sections, 5 theorems, 45 equations, 2 figures, 1 table.

Key Result

Theorem 1

Let $\mathcal{D} = \{(x_i, y_i, s_i, \pi_i, u_i)\}_{i=1}^N$ denote the training data, and let $(x_{N+1}, y_{N+1}, s_{N+1}, \pi_{N+1}, u_{N+1})$ be a new test point. Suppose the neural network $f_{\hat{\theta}}$ is trained on $\mathcal{D}$, and define the conformal prediction set where $q_{1-\alpha} = \text{Quantile}_{1-\alpha}(\{e_{\hat{\theta}}(y_i, s_i, \pi_i, u_i)\}_{i=1}^N)$ is the $(1-\alpha

Figures (2)

  • Figure 1: Sender utility across 20 simulations for four methods: (left) frequency distribution showing the spread of utilities; (right) box plot of utility distributions with median, quartiles, and outliers.
  • Figure 2: Empirical coverage under policy shift.

Theorems & Definitions (11)

  • Theorem 1: Conformal Coverage Guarantee
  • proof
  • Theorem 2: Coverage under Policy Shift
  • proof
  • Theorem 3: Neural Network Approximation and Estimation
  • proof
  • Theorem 4: Sample Complexity for Policy Optimization
  • proof
  • Theorem 5: Robust Utility Lower Bound
  • proof
  • ...and 1 more