The Price of Progress: Algorithmic Efficiency and the Falling Cost of AI Inference
Hans Gundlach, Jayson Lynch, Matthias Mertens, Neil Thompson
TL;DR
The paper addresses the real-world efficiency of AI inference by measuring how much it costs to achieve a given benchmark performance. Using a large dataset of benchmark prices from Epoch AI and Internet Archive, the authors apply regression analyses to quantify price trends, distinguishing frontier (open- and closed-weight) models from the broader model set. They find frontier models offer price-perf improvements of about 5-10x per year, with algorithmic progress (hardware-adjusted) around 3x per year for open-weight models, while benchmarking costs often rise or stay flat, offsetting some gains. The study argues for transparent reporting of computational resources in evaluations to better reflect practical impact and guide future benchmarking practices.
Abstract
Language models have seen enormous progress on advanced benchmarks in recent years, but much of this progress has only been possible by using more costly models. Benchmarks may therefore present a warped picture of progress in practical capabilities per dollar. To remedy this, we use data from Artificial Analysis and Epoch AI to form the largest dataset of current and historical prices to run benchmarks to date. We find that the price for a given level of benchmark performance has decreased remarkably fast, around $5\times$ to $10\times$ per year, for frontier models on knowledge, reasoning, math, and software engineering benchmarks. These reductions in the cost of AI inference are due to economic forces, hardware efficiency improvements, and algorithmic efficiency improvements. Isolating out open models to control for competition effects and dividing by hardware price declines, we estimate that algorithmic efficiency progress is around $3\times$ per year. Finally, we recommend that evaluators both publicize and take into account the price of benchmarking as an essential part of measuring the real-world impact of AI.
