Introducing Grid WAR: Rethinking WAR for Starting Pitchers
Ryan S. Brill, Abraham J. Wyner
TL;DR
The paper critiques standard WAR for starting pitchers as overly reliant on season-long averages and sequencing-agnostic metrics, proposing Grid WAR (GWAR) as a per-game, convex, context-neutral valuation. GWAR combines a grid-based context-neutral win probability $f(I,R)$ with a mid-inning continuation function $g(r|S,O)$, a replacement baseline $w_{rep}$, and park effects $oldsymbol{eta}^{(park)}$, with parameters estimated via a Poisson Empirical Bayes model and ridge park factors. Empirical results reveal that GWAR reweights pitcher contributions, tends to upweight high-variance performances, and provides better predictive validity for future GWAR than traditional FWAR-based estimates, supporting the view that game-by-game variance contains systematic signal. An online Shiny app at gridwar.xyz hosts per-game, per-season, and per-career GWAR results, offering a new, more nuanced lens on pitcher valuation and historical comparisons.
Abstract
The baseball statistic "Wins Above Replacement" (WAR) has emerged as one of the most popular evaluation metrics. But it is not readily observed and tabulated; WAR is an estimate of a parameter in a vaguely defined model with all its attendant assumptions. Industry-standard models of WAR for starting pitchers from FanGraphs and Baseball Reference all assume that season-long averages are sufficient statistics for a pitcher's performance. This provides an invalid mathematical foundation for many reasons, especially because WAR should not be linear with respect to any counting statistic. To repair this defect, as well as many others, we devise a new measure, Grid WAR, which accurately estimates a starting pitcher's WAR on a per-game basis. The convexity of Grid WAR diminishes the impact of "blow-up" games and upweights exceptional games, raising the valuation of pitchers like Sandy Koufax, Whitey Ford, and Catfish Hunter who exhibit fundamental game-by-game variance. Grid WAR is designed to accurately measure past performance, but also has predictive value insofar as a pitcher's Grid WAR is better than WAR at predicting future performance. Finally, at https://gridwar.xyz we host a Shiny app which displays the Grid WAR results of each MLB game since 1952, including career, season, and game level results, which updates automatically every morning.
