Provably Efficient Algorithm for Best Scoring Rule Identification in Online Principal-Agent Information Acquisition

Zichen Wang; Chuanhao Li; Huazheng Wang

Provably Efficient Algorithm for Best Scoring Rule Identification in Online Principal-Agent Information Acquisition

Zichen Wang, Chuanhao Li, Huazheng Wang

TL;DR

This work studies how to identify the optimal scoring (persuasive signaling) rule in an online principal-agent information acquisition setting. It introduces two adaptive algorithms, OLTPFC (fixed confidence) and OLTPFB (fixed budget), and proves that OLTPFC can recover an $(\epsilon, \delta)$-optimal signaling rule with an instance-dependent or instance-independent sample complexity. The analysis shows OLTPFB matches the instance-independent performance of OLTPFC, and both methods share comparable complexity across fixed-confidence and fixed-budget regimes. The results provide concrete, provable guarantees for efficiently eliciting the desired signaling rule in dynamic information-acquisition environments.

Abstract

We investigate the problem of identifying the optimal scoring rule within the principal-agent framework for online information acquisition problem. We focus on the principal's perspective, seeking to determine the desired scoring rule through interactions with the agent. To address this challenge, we propose two algorithms: OIAFC and OIAFB, tailored for fixed confidence and fixed budget settings, respectively. Our theoretical analysis demonstrates that OIAFC can extract the desired $(ε, δ)$-scoring rule with a efficient instance-dependent sample complexity or an instance-independent sample complexity. Our analysis also shows that OIAFB matches the instance-independent performance bound of OIAFC, while both algorithms share the same complexity across fixed confidence and fixed budget settings.

Provably Efficient Algorithm for Best Scoring Rule Identification in Online Principal-Agent Information Acquisition

TL;DR

-optimal signaling rule with an instance-dependent or instance-independent sample complexity. The analysis shows OLTPFB matches the instance-independent performance of OLTPFC, and both methods share comparable complexity across fixed-confidence and fixed-budget regimes. The results provide concrete, provable guarantees for efficiently eliciting the desired signaling rule in dynamic information-acquisition environments.

Abstract

-scoring rule with a efficient instance-dependent sample complexity or an instance-independent sample complexity. Our analysis also shows that OIAFB matches the instance-independent performance bound of OIAFC, while both algorithms share the same complexity across fixed confidence and fixed budget settings.

Provably Efficient Algorithm for Best Scoring Rule Identification in Online Principal-Agent Information Acquisition

TL;DR

Abstract

Provably Efficient Algorithm for Best Scoring Rule Identification in Online Principal-Agent Information Acquisition

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Theorems & Definitions (15)