Entropy-Based Strategies for Multi-Bracket Pools
Ryan S. Brill, Abraham J. Wyner, Ian J. Barnett
TL;DR
The paper addresses the challenge of generating multiple bracket predictions in high-dimensional parimutuel pools. It proposes an entropy-based, information-theoretically grounded approach that samples $n$ i.i.d. brackets from a parametric distribution, with performance improving as bracket entropy increases with $n$ and with opponent entropy. Through a canonical bitstring guessing example and real-world cases (pick six and March Madness), the authors show that optimizing bracket distribution entropy yields tractable solutions and practical gains, especially under favorable carryover or against diverse opponent strategies. The framework connects to the Asymptotic Equipartition Property and provides actionable guidance for designing multi-bracket strategies with scalable performance. The work lays a foundation for further refinement under unknown probabilities and more nuanced, round-wise entropy tuning.
Abstract
Much work in the parimutuel betting literature has discussed estimating event outcome probabilities or developing optimal wagering strategies, particularly for horse race betting. Some betting pools, however, involve betting not just on a single event, but on a tuple of events. For example, pick six betting in horse racing, March Madness bracket challenges, and predicting a randomly drawn bitstring each involve making a series of individual forecasts. Although traditional optimal wagering strategies work well when the size of the tuple is very small (e.g., betting on the winner of a horse race), they are intractable for more general betting pools in higher dimensions (e.g., March Madness bracket challenges). Hence we pose the multi-brackets problem: supposing we wish to predict a tuple of events and that we know the true probabilities of each potential outcome of each event, what is the best way to tractably generate a set of $n$ predicted tuples? The most general version of this problem is extremely difficult, so we begin with a simpler setting. In particular, we generate $n$ independent predicted tuples according to a distribution having optimal entropy. This entropy-based approach is tractable, scalable, and performs well.
