Pay for The Second-Best Service: A Game-Theoretic Approach Against Dishonest LLM Providers
Yuhan Cao, Yu Wang, Sitong Liu, Miao Li, Yixin Tao, Tianxing He
TL;DR
We address the vulnerability of dishonest LLM API providers by casting user-provider interactions as a sequential delegation game and designing a mechanism that mitigates deceit through incentive alignment. The authors prove an impossibility result for achieving first-best user utility and provide an $O(T^{1-\epsilon}\log T)$-approximate incentive-compatible mechanism that delivers near second-best user utility, implemented via four phases: exploration, exploitation, and two blind-trust stages. The theoretical results are complemented by simulations using real-world API settings, demonstrating robustness and near-optimal user utilities across scenarios. This work introduces a principled economic framework for securing LLM service markets and suggests practical directions for future research, including multi-user extensions and budget constraints.
Abstract
The widespread adoption of Large Language Models (LLMs) through Application Programming Interfaces (APIs) induces a critical vulnerability: the potential for dishonest manipulation by service providers. This manipulation can manifest in various forms, such as secretly substituting a proclaimed high-performance model with a low-cost alternative, or inflating responses with meaningless tokens to increase billing. This work tackles the issue through the lens of algorithmic game theory and mechanism design. We are the first to propose a formal economic model for a realistic user-provider ecosystem, where a user can iteratively delegate $T$ queries to multiple model providers, and providers can engage in a range of strategic behaviors. As our central contribution, we prove that for a continuous strategy space and any $ε\in(0,\frac12)$, there exists an approximate incentive-compatible mechanism with an additive approximation ratio of $O(T^{1-ε}\log T)$, and a guaranteed quasi-linear second-best user utility. We also prove an impossibility result, stating that no mechanism can guarantee an expected user utility that is asymptotically better than our mechanism. Furthermore, we demonstrate the effectiveness of our mechanism in simulation experiments with real-world API settings.
