Assessing win strength in MLB win prediction models
Morgan Allen, Paul Savala
TL;DR
The paper investigates how predicted win probabilities from a broad set of MLB win-prediction models relate to actual score differentials, framing win-strength as a measurable attribute. By training on 2001–2015 data and testing on 2016–2019 data, it compares six model families plus a FiveThirtyEight baseline using AUROC, log-loss, and Brier score, and finds that most models outperform the baseline with logistic regression often delivering top overall performance. The study further demonstrates that predicted win probabilities correlate positively with score differentials on average, and that a targeted run-line betting strategy using probabilistic cutoffs can yield positive returns. Overall, the work highlights meaningful linkages between win likelihood and win strength, while offering practical insights for betting strategies and model ensemble design.
Abstract
In Major League Baseball, strategy and planning are major factors in determining the outcome of a game. Previous studies have aided this by building machine learning models for predicting the winning team of any given game. We extend this work by training a comprehensive set of machine learning models using a common dataset. In addition, we relate the win probabilities produced by these models to win strength as measured by score differential. In doing so we show that the most common machine learning models do indeed demonstrate a relationship between predicted win probability and the strength of the win. Finally, we analyze the results of using predicted win probabilities as a decision making mechanism on run-line betting. We demonstrate positive returns when utilizing appropriate betting strategies, and show that naive use of machine learning models for betting lead to significant loses.
