Table of Contents
Fetching ...

Computational Arbitrage in AI Model Markets

Ricardo Olmedo, Bernhard Schölkopf, Moritz Hardt

Abstract

Consider a market of competing model providers selling query access to models with varying costs and capabilities. Customers submit problem instances and are willing to pay up to a budget for a verifiable solution. An arbitrageur efficiently allocates inference budget across providers to undercut the market, thus creating a competitive offering with no model-development risk. In this work, we initiate the study of arbitrage in AI model markets, empirically demonstrating the viability of arbitrage and illustrating its economic consequences. We conduct an in-depth case study of SWE-bench GitHub issue resolution using two representative models, GPT-5 mini and DeepSeek v3.2. In this verifiable domain, simple arbitrage strategies generate net profit margins of up to 40%. Robust arbitrage strategies that generalize across different domains remain profitable. Distillation further creates strong arbitrage opportunities, potentially at the expense of the teacher model's revenue. Multiple competing arbitrageurs drive down consumer prices, reducing the marginal revenue of model providers. At the same time, arbitrage reduces market segmentation and facilitates market entry for smaller model providers by enabling earlier revenue capture. Our results suggest that arbitrage can be a powerful force in AI model markets with implications for model development, distillation, and deployment.

Computational Arbitrage in AI Model Markets

Abstract

Consider a market of competing model providers selling query access to models with varying costs and capabilities. Customers submit problem instances and are willing to pay up to a budget for a verifiable solution. An arbitrageur efficiently allocates inference budget across providers to undercut the market, thus creating a competitive offering with no model-development risk. In this work, we initiate the study of arbitrage in AI model markets, empirically demonstrating the viability of arbitrage and illustrating its economic consequences. We conduct an in-depth case study of SWE-bench GitHub issue resolution using two representative models, GPT-5 mini and DeepSeek v3.2. In this verifiable domain, simple arbitrage strategies generate net profit margins of up to 40%. Robust arbitrage strategies that generalize across different domains remain profitable. Distillation further creates strong arbitrage opportunities, potentially at the expense of the teacher model's revenue. Multiple competing arbitrageurs drive down consumer prices, reducing the marginal revenue of model providers. At the same time, arbitrage reduces market segmentation and facilitates market entry for smaller model providers by enabling earlier revenue capture. Our results suggest that arbitrage can be a powerful force in AI model markets with implications for model development, distillation, and deployment.
Paper Structure (30 sections, 9 equations, 12 figures, 1 table)

This paper contains 30 sections, 9 equations, 12 figures, 1 table.

Figures (12)

  • Figure 1: Consider a model market with three providers: GPT-5 mini, DeepSeek v3.2, and an arbitrageur. Consumers demand a target level of performance on SWE-bench–type tasks; for example, a 75% SWE-bench solve rate. Through repeated sampling, GPT-5 mini and DeepSeek each achieve a 75% SWE-bench solve rate at costs of $150 and $120, respectively. The arbitrageur instead sources generations by first querying GPT-5 mini (at up to $0.08 per problem) and, if that fails, querying DeepSeek. Using this strategy, the arbitrageur attains the same 75% solve rate at a cost of $80. This cost advantage creates a profit opportunity: the arbitrageur can resell its sourced generations at markups of up to 50% while still undercutting the market.
  • Figure 2: Inference cost for GPT-5 mini and DeepSeek v3.2 to reach different SWE-bench performance levels, with inference budgets scaled through repeated sampling (up to $1 per issue). We also evaluate the following arbitrage policy: allocate up to $0.08 to GPT-5 mini and, if it fails, spend the remaining $0.92 on DeepSeek. The arbitrage policy (red) achieves solve rates above 68% at a lower cost than either GPT-5 mini or DeepSeek. This cost advantage enables the arbitrageur to profit by reselling its generations close to market price (purple).
  • Figure 3: Two arbitrageurs deploy the same arbitrage policy but compete on price. They take turns updating their prices to undercut each other. Earlier turns are plotted with greater transparency. Left: Competition between arbitrageurs drives market prices downward. In equilibrium, the market price equals the arbitrageurs' buy price. Right: While arbitrage is initially highly profitable, profit opportunities eventually vanish.
  • Figure 4: Revenue split across model providers for different levels of performance. Left: In the absence of arbitrageurs, the market is segmented by performance, with a single model dominating each segment. Middle: Arbitrageurs eliminate this segmentation, allowing both models to earn revenue across a much broader range of levels of performance. Right: Arbitrageurs reduce providers’ marginal revenue, with the lost surplus transferred to arbitrageur profits or passed on to consumers as lower market prices.
  • Figure 5: Profit margin across different search budgets when fitting the arbitrage policy. The solid line represents mean profitability, whereas the shaded area indicates the 95% confidence interval, computed by bootstrapping over the samples acquired within the search budget. Left: When fitting a fixed query distribution, small search budgets (e.g., $10) consistently yield profitable arbitrage policies. Middle and right: We fit the arbitrageur either on software issues from the Django library or on issues from other repositories, and evaluate the resulting arbitrage policy on the held-out data. We find that, on expectation, the arbitrageur remains profitable under such distribution shifts in the query distribution.
  • ...and 7 more figures

Theorems & Definitions (1)

  • Definition 2.1: Arbitrage Opportunity