A Counterfactual Analysis of the Dishonest Casino
Martin Haugh, Raghav Singal
TL;DR
This paper addresses how to attribute winnings in a two-state hidden Markov model (the dishonest casino) to cheating through a counterfactual lens. It embeds the HMM in a structural causal model and derives sharp, linear-programming bounds (EWAC) for the counterfactual winnings attributable to cheating, leveraging a marginals-only formulation over a copula-parameterized joint emission matrix. Time-homogeneity tightens these bounds, while time-inhomogeneity yields separable LPs with analytic solutions; crucially, the time-averaged EWAC becomes fully identifiable as the horizon grows. The work contributes a first-of-its-kind LP-based framework for counterfactual bounds in dynamic latent-state models and offers a tractable educational tool for illustrating partial identification and the role of domain knowledge in causal inference.
Abstract
The dishonest casino is a well-known hidden Markov model (HMM) often used in education to introduce HMMs and graphical models. A sequence of die rolls is observed with the casino switching between a fair and a loaded die. Instead of recovering the latent regime through filtering, smoothing, or the Viterbi algorithm, we ask a counterfactual question: how much of the gambler's winnings are caused by the casino's cheating? We introduce a class of structural causal models (SCMs) consistent with the HMM and define the expected winnings attributable to cheating (EWAC). Because EWAC is only partially identifiable, we bound it via linear programs (LPs). Numerical experiments help to develop intuition using benchmark SCMs based on independence, comonotonic, and countermonotonic copulas. Imposing a time homogeneity condition on the SCM yields tighter bounds, whereas relaxing it produces looser bounds that admit an explicit LP solution. Domain knowledge such as pathwise monotonicity or counterfactual stability can be incorporated through additional linear constraints. Finally, we show the time-averaged EWAC becomes fully identifiable as the number of time periods tends to infinity. Our work is the first to develop LP bounds for counterfactuals in an HMM setting, benefiting educational contexts where counterfactual inference is taught.
