General measures of effect size to calculate power and sample size for Wald tests with generalized linear models
Amy L Cochran, Shijie Yuan, Paul J Rathouz
TL;DR
This work tackles the challenge of power and sample size calculations for Wald tests in GLMs with multiple predictors and adjustors, where full distributional specification is often impractical. It introduces two general effect-size measures, $oldsymbol{ extphi}_{x|z}$ and $R^2_{x|z}$, grounded in first- and second-moment information, to approximate the noncentrality parameter essential for power calculations. The authors derive approximation bounds, explore asymptotic local-alternative behavior, and validate the approach via simulations across common GLMs and a real-case study on education and mental health treatment. The framework extends linear-regression concepts (partial $R^2$, Cohen’s d) to GLMs, enabling more flexible and interpretable PSS planning across diverse models, though its accuracy hinges on predictor-adjustor variance and distributional features. This work has practical implications for study design, offering a tractable route to power analysis in complex GLM settings without full joint-distribution requirements.
Abstract
Power and sample size calculations for Wald tests in generalized linear models (GLMs) are often limited to specific cases like logistic regression. More general methods typically require detailed study parameters that are difficult to obtain during planning. We introduce two new effect size measures for estimating power and sample size in studies using Wald tests across any GLM. These measures accommodate any number of predictors or adjusters and require only basic study information. We provide practical guidance for interpreting and applying these measures to approximate a key parameter in power calculations. We also derive asymptotic bounds on the relative error of these approximations, showing that accuracy depends on features of the GLM such as the nonlinearity of the link function. To complement this analysis, we conduct simulation studies across common model specifications, identifying best use cases and opportunities for improvement. Finally, we test the methods in finite samples to confirm their practical utility, using a case study on the relationship between education and receipt of mental health treatment.
