Assessing the impact of variance heterogeneity and misspecification in mixed-effects location-scale models
Vincent Jeanselme, Marco Palma, Jessica K Barrett
TL;DR
This paper tackles the problem of variance heterogeneity in longitudinal analyses by evaluating mixed-effects location-scale models (MELSM) as an alternative to traditional linear mixed models (LMM). Through a comprehensive simulation study based on PBC covariates, it demonstrates that ignoring heteroscedasticity in LMMs leads to undercoverage and biased variance components, while MELSM can recover location estimates more reliably when the scale is correctly specified. The results reveal nuanced effects of misspecification: scale misspecification hurts the precision of location estimates, location misspecification biases scale estimates, and there is a dynamic interplay between random effects and residual variance. The case study on the PBC dataset reinforces the practical value of MELSM for detecting heteroscedasticity and informs methodological guidance for extending these ideas to GLMMs and joint models in survival settings.
Abstract
Linear Mixed Model (LMM) is a common statistical approach to model the relation between exposure and outcome while capturing individual variability through random effects. However, this model assumes the homogeneity of the error term's variance. Breaking this assumption, known as homoscedasticity, can bias estimates and, consequently, may change a study's conclusions. If this assumption is unmet, the mixed-effect location-scale model (MELSM) offers a solution to account for within-individual variability. Our work explores how LMMs and MELSMs behave when the homoscedasticity assumption is not met. Further, we study how misspecification affects inference for MELSM. To this aim, we propose a simulation study with longitudinal data and evaluate the estimates' bias and coverage. Our simulations show that neglecting heteroscedasticity in LMMs leads to loss of coverage for the estimated coefficients and biases the estimates of the standard deviations of the random effects. In MELSMs, scale misspecification does not bias the location model, but location misspecification alters the scale estimates. Our simulation study illustrates the importance of modelling heteroscedasticity, with potential implications beyond mixed effect models, for generalised linear mixed models for non-normal outcomes and joint models with survival data.
