Hamiltonian Monte Carlo Inference of Marginalized Linear Mixed-Effects Models
Jinlin Lai, Justin Domke, Daniel Sheldon
TL;DR
This work presents a principled method to analytically marginalize random effects in linear mixed-effects models within Hamiltonian Monte Carlo, leveraging fast linear-algebra techniques (matrix inversion and determinant lemmas) and the tree-structured design matrices to avoid cubic-time computations. By reducing the latent dimensionality and preserving exploitable structure, the approach substantially improves sampling efficiency and runtime across diverse LMMs, especially in cognitive-science datasets, and can be implemented in probabilistic programming frameworks like NumPyro. The authors extend the method to multiple effects under scaled-identity covariance assumptions with feasible preprocessing and per-sample costs, compare marginalization to non-centered parameterizations, and demonstrate substantial practical gains through extensive experiments including cross-effects, vectorization benefits, and cognitive-science applications. The results suggest that marginalizing applicable random effects should be standard practice in Bayesian LMM inference, with broad implications for PPL workflows and scalable hierarchical modeling.
Abstract
Bayesian reasoning in linear mixed-effects models (LMMs) is challenging and often requires advanced sampling techniques like Markov chain Monte Carlo (MCMC). A common approach is to write the model in a probabilistic programming language and then sample via Hamiltonian Monte Carlo (HMC). However, there are many ways a user can transform a model that make inference more or less efficient. In particular, marginalizing some variables can greatly improve inference but is difficult for users to do manually. We develop an algorithm to easily marginalize random effects in LMMs. A naive approach introduces cubic time operations within an inference algorithm like HMC, but we reduce the running time to linear using fast linear algebra techniques. We show that marginalization is always beneficial when applicable and highlight improvements in various models, especially ones from cognitive sciences.
