Multilevel Metamodels: Enhancing Inference, Interpretability, and Generalizability in Monte Carlo Simulation Studies
Joshua Gilbert, Luke Miratrix
TL;DR
The paper addresses how to robustly summarize Monte Carlo simulation results when multiple models are applied to the same simulated data. It advocates multilevel metamodels (MLMMs) to account for dependence across models and to enable generalizability assessments via random effects and empirical Bayes ranges. Through a running covariate-adjustment example in RCTs, the authors demonstrate that two-level MLMMs improve precision over OLS for aggregated results, while three-level MLMMs extending to individual results provide richer insights into consistency across simulation conditions and generalizability. The work highlights practical benefits for interpreting simulation findings, guiding model choice, and informing statistical practice, while noting limitations and avenues for future research, including Bayesian approaches and preregistration.
Abstract
Metamodels, or the regression analysis of Monte Carlo simulation results, provide a powerful tool to summarize simulation findings. However, an underutilized approach is the multilevel metamodel (MLMM) that accounts for the dependent data structure that arises from fitting multiple models to the same simulated data set. In this study, we articulate the theoretical rationale for the MLMM and illustrate how it can improve the interpretability of simulation results, better account for complex simulation designs, and provide new insights into the generalizability of simulation findings.
