Identifiability and Falsifiability: Two Challenges for Bayesian Model Expansion
Collin Cademartori
TL;DR
This work analyzes how Bayesian model expansion impacts identifiability and falsifiability of inferences, introducing identifiability mutual information (IMI) and falsifiability mutual information (FMI) as principled proxies. A central result shows that, when an expanded model becomes sufficiently complex, there is a trade-off: improving identifiability can come at the cost of weaker falsifiability, or vice versa, unless prior structure constrains expansion. The authors formalize this via an information-theoretic framework that includes an expansion ratio and a contraction coefficient, and they illustrate the theory with three worked examples: linear regression, unknown variance, and hierarchical priors. They also discuss practical implications for prior design, such as using prior expansions or penalized complexity priors to limit the detrimental effects of expansion on IMI and FMI, while acknowledging limitations related to computation and misspecification. Overall, the paper clarifies when and how model expansion can be advantageous or risky, and provides guidance for constructing expansions that preserve inferential and predictive validity.
Abstract
We study the identifiability of parameters and falsifiability of predictions under the process of model expansion in a Bayesian setting. Identifiability is represented by the closeness of the posterior to the prior distribution and falsifiability by the power of posterior predictive tests against alternatives. To study these two concepts formally, we develop information-theoretic proxies, which we term the identifiability and falsifiability mutual information. We argue that these are useful indicators, with lower values indicating a risk of poor parameter inference and underpowered model checks, respectively. Our main result establishes that a sufficiently complex expansion of a base statistical model forces a trade-off between these two mutual information quantities -- at least one of the two must decrease relative to the base model. We illustrate our result in three worked examples and extract implications for model expansion in practice. In particular, we show as an implication of our result that the negative impacts of model expansion can be limited by offsetting complexity in the likelihood with sufficiently constraining prior distributions.
